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Curtains for jets? 


Emissions from Saturn’s moon Enceladus explained 
as curtain eruptions — and ‘phantom’ jets pases? 
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Splice of life 


Researchers, bioethicists and regulators must contribute to transparent discussions on the risks 


and ethics of editing human embryos. 


human embryos induced a predictable sharp intake of breath 

(see Nature http://doi.org/3xt; 2015). The work is notable 
because it altered the germ line, meaning that in a viable embryo, the 
genetic changes would have been passed on to all future offspring. 
What should be society’s response to such research? How should the 
scientific community view other current and foreseeable experiments 
along similar lines, and what should it do about them? 

Gene-editing tools have evolved to the point at which targeted 
changes to a genome can be made with unprecedented ease. In theory, 
gene editing allows specific genetic traits to be changed. The potential 
clinical applications, in which babies are engineered so that they no 
longer carry faulty, disease-causing genes that run in the family, might 
be attractive to many. But even such potentially legitimate clinical 
applications remain some way off. There are also longer-term ethical 
concerns that germline gene therapy might creep beyond eliminating 
deadly or debilitating heritable disorders to include disabilities, less 
serious conditions, and cosmetic and other supposed enhancements 
— leading to ‘designer babies’ and raising the spectre of eugenics. 

Now is a good time for a public debate about such human germline 
editing — gene editing in sperm, eggs or embryos applied in ways that 
would allow changes to propagate to subsequent generations. Not only 
should voices from civil society outside the closeted worlds of science, 
bioethics and regulation be heard, but their highly diverse viewpoints 
must also help to set the terms of the debate. The accumulated knowl- 
edge and experience of the relevant academic disciplines and regula- 
tors needs to be taken into account. Ultimately, such debates should be 
resolved with international discussion, and regulation at national levels. 


r | Vhe news last month that scientists had edited the genomes of 


PRACTICAL CONSIDERATIONS 

The latest research, published in Protein & Cell, demonstrates the 
issues (P. Liang et al. Protein Cell http://doi.org/34q; 2015). The 
researchers deliberately used embryos that were products of eggs 
fertilized by two sperm, and so could never grow into a baby. The 
details of the work highlight why attempting human germline gene 
therapy using editing techniques any time soon would be a terrible 
mistake. The efficiency of genetic modification turned out to be poor, 
and the technique generated many unintended mutations. It could be 
along time before researchers can demonstrate that the benefits of the 
procedure would outweigh the risks. Until such a time, it is clear that 
no sensible laboratory, regulator or nation should even consider any 
attempt to implant and develop to birth an edited embryo. 

The potential power and relative ease of gene editing offer compel- 
ling reasons to support such research, however. The latest work, for 
example, aimed to edit a gene that when mutated is responsible for 
the blood disorder B-thalassaemia. (The gene also helps to protect 
against Plasmodium falciparum malaria.) Extending the research 
could help us to understand and treat the blood disorder, forms of 


which can be severe. There is also a strong basic-science incentive for 
such experiments, which can help us to understand human develop- 
ment and perhaps be used to produce useful cell lines. A total ban on 
research would therefore seem counterproductive. 

But there is also a need to keep germline gene therapy in perspective. 
Preimplantation genetic diagnosis and selection of healthy embryos 
during in vitro fertilization already provides a safer alternative for 

avoiding genetic disease in newborns — as 


“Thereisaneed can prenatal screening and abortion. The 
tokeep germline diseases for which gene editing would be 
gene therapy in superior are few. 


Many countries ban or restrict research that 
involves the destruction of human embryos, 
and moreover bar human germline modification. Even in countries 
with more-liberal laws, there is a de facto ban on gene editing as part of 
a human-reproduction technology, because the safety and efficacy of 
such work would not meet existing clinical-trial standards. Debates on 
other genetic-engineering topics such as recombinant DNA and somatic 
cloning have touched on many of the issues relevant to germline editing. 
What usually emerges from such discussions is a green light for properly 
regulated research, with tight restrictions on how that research could be 
applied. The same outcome seems the most sensible here, and probably 
the most likely, in light of the embryo-editing work. 

But all involved must actively work to make that happen, and 
not passively assume that the field will simply evolve towards best 
practice. How should a more general discussion proceed? Whether 
in collaboration or separately, national governments need to step up 
on this issue. Scientists, companies and ethicists are already voicing 
their views and setting up further meetings. 

Also helpful might be an official forum of experts to assist emergent 
policy discussions — an international meeting of scientists, regulators, 
ethicists and representatives of civil society, perhaps convened under 
the auspices of the World Health Organization (WHO). Such a meeting 
could take stock of the state of the science of gene editing, and articulate 
the regulatory and ethical landscapes. It could then quickly move to help 
close any gaps in legislation, and develop a regulatory framework for 
the inevitable germline-related advances in gene-editing techniques. 

A model perhaps is a similar meeting convened by the WHO last 
year to rapidly assess the ethics of emergency clinical trials of Ebola 
drugs and vaccines that had not been fully tested for their safety. As 
with Ebola, any meeting on germline gene editing should also be given 
access to the unpublished results of ongoing experiments. 

Transparent and inclusive discussion of issues raised by gene-editing 
technologies that could open the door to germline gene therapy is a 
must. Scientifically and ethically informed contributions would remind 
people that for the foreseeable future, science-fiction scenarios of 
‘designer babies’ remain just that, while providing an articulation of the 
limitations of our scientific understanding. = 
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Dirty money 


The fossil-fuel divestment campaign raises 
important questions but offers few solutions. 


shares in fossil-fuel firms is gathering momentum. The 

divestment movement argues that universities and other 
institutions should not even tacitly support the companies that sell 
the products responsible for greenhouse gases. It is a moral argu- 
ment that carries particular weight with universities, which have 
helped to spell out the dangers of emissions, but one that fails as a 
serious strategy for tackling climate change. 

These are complex issues, as demonstrated by recent conflicting deci- 
sions at three universities that have considered divestment. As we report 
on page 16, SOAS University of London announced on 24 April that it 
would give up all of its fossil-fuel investments in the next three years. 
Six days later, the New York University (NYU) senate adopted a resolu- 
tion calling on the university to maintain its investments in some 200 
companies that deal in fossil fuels, but to develop a greener investment 
strategy for the future. The resolution sought to prevent further invest- 
ment in fossil fuels, but said that divestment of existing investments 
would reduce returns from the university’s US$3.4-billion endowment. 
And on 2 May, Swarthmore College in Pennsylvania announced that it 
would create an alternative fund that is free of fossil-fuel investments, 
instead of relinquishing investments from its existing endowment. 

A report by an NYU senate committee says that $139.7 million — 
4.1% of the school’s endowment — is invested in the 200 fossil-fuel 
companies of concern. But just $700,000 falls under the direct con- 
trol of the university’s fund managers. The rest is invested through 
external brokers who typically purchase shares in funds with a broad 
portfolio of assets. As a consequence, divesting the $139 million 
would mean selling off and reinvesting shares worth $1.3 billion. It 
could be done, but the committee unanimously recommended that 


A global campaign to persuade organizations to sell their 


the senate hold on to these investments. 

NYU is hardly unique. Just about anybody who has a retirement 
account or owns shares in a mutual fund is likely to find themselves 
in the same situation. And even if they do not, fossil fuels have a role 
in pretty much every facet of their daily lives, from the electricity that 
they use to the food they eat, the home they live in and the transpor- 
tation they take to work — bicycles and public transport included. 

Divestment is a complicated affair, and 


“Even advocates _ avoiding the worldly benefits that fossil fuels 
of divestment offer to citizens in developed countries is 
admit that the downright impossible. 

main purpose So far, at least 28 universities have taken a 
is to raise stand against fossil fuels, but the benefits of 


institutional divestment are not clear. Uni- 
versities that sell their shares in fossil-fuel 
companies must find buyers, so the most that they can hope to achieve 
is to push down the stock price a little. But the world still runs on fossil 
fuels, and until a better option arises, the current business model will 
surely continue. Furthermore, where would the universities put the cash 
that they have freed up through divestment? Extra investment is always 
welcome, but it is not at all clear that the relatively small clean-energy 
sector could absorb a cash infusion of the scale under discussion here. 

In the end, even advocates of divestment admit that the main purpose 
of the campaign is to raise awareness. The movement is a by-product 
of the fact that governments have been slow to act, and frustration is 
understandable. The question is how to harness that angry energy — 
without further polarizing the debate. This is a collective problem, and 
vilifying the fossil-fuel industry merely displaces blame. 

The real challenge is to bolster the science and implement effective 
public policies that will drive all investments in the right direction. 
Fossil-fuel companies must play their part, and those that do not 
may ultimately succumb to a new generation of energy companies. 
All investors will need to negotiate this transition, which comes with 
both risks and huge opportunities. But the primary role of universi- 
ties, irrespective of how they choose to invest their endowments, is 
to conduct research, inform public policy and educate the leaders of 
the future. m 


awareness.” 


Greek cash grab 


Government’s decision to plunder university 
funds shows lack of respect for science. 


minister had some researchers hoping that the incoming gov- 

ernment, led by Alexis Tsipras, was determined to save Greek 
science from the vortex of the country’s debt crisis. Some hope. The 
dominant mood among the nation’s scientists now is outrage at new 
measures that raid research funds in a bid to stave off financial col- 
lapse. A decree approved on 24 April forces Greek universities and 
research centres to transfer any cash reserves they still hold to the 
Greek central bank. 

Faced with horrendous capital flight from Greece, the government 
intends to use the funds — money that research organizations have 
set aside to pay electricity bills and other overhead costs not covered 
by external grants — to meet pressing financial obligations to inter- 
national creditors. 

Athens’s high-handed grip on disposable university budgets is 
unlikely to generate any substantial revenue. Instead, the ill-conceived 
move threatens to rekindle Greek scientists’ long-held aversion to the 
state. To balance the books, many observers point out, Greece would be 
better taking aim at the long-standing problem of poor tax compliance. 


To appointment of physicist Costas Fotakis as Greek science 
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The government has assured outraged university rectors that the 
measure is temporary, and that universities can expect to get their 
money back without losses. And in the worst case — if Greece were to 
default on its loans and had to leave the eurozone — money deposited 
at private banks would be more at risk than deposits with the Bank of 
Greece, says Fotakis. 

That may be so. But handing over research money to a government 
that has yet to prove its fiscal competence comes with a risk of its own. 
And with massive budget cuts over the past five years, Greek science 
has already paid its fair share towards solving the debt crisis. 

The decree is not the only problem. A controversial bill proposed last 
month would reverse some of the reforms introduced by the previous 
government to bring Greece's science and higher-education system in 
line with European norms. The government should be wary of changes 
that clash with attempts to make Greek universities more attractive to 
foreign students and scientists. 

With a research expenditure of less than 0.6% of gross domes- 
tic product, Greece is one of the least science-friendly countries in 
Europe. Strengthening science is essential to stimulate economic 
growth and create the jobs that Greece so urgently needs. 

If the government takes science seriously — and Fotakis’s appoint- 
ment was a signal that it does — it should scrap the idea of borrowing 
money from cash-strapped research organi- 
zations. Political farsightedness and respect 
for science — a profession of truly Greek ori- 
gin — demand that the detrimental decree be 
reversed. = 
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process in Duchenne muscular dystrophy (DMD) and using that 

information to develop treatments. DMD is one of the most dev- 
astating childhood genetic disorders in the world. There is currently 
no effective treatment, so there is a huge unmet clinical need. Every so 
often, we get an unexpected finding that leads us to a new approach to 
therapy. One of our discoveries has led to clinical trials of a candidate 
treatment. This drug is promising because it targets the underlying 
cause of the disease. 

Between the painstaking cell-line analysis and trials in humans 
there is a crucial step: validation of our theories in mice. Our work 
in cell lines, although useful, gives no indication of how effectively 
a treatment will combat the disease in crucial muscles such as the 
diaphragm and heart, given that most patients 
die from either respiratory or cardiac defects. 

Even though personal experience as a scien- 
tist teaches me that, as things stand, no progress 
can be made in improving human health with- 
out animal research, the decision to use animals 
should never be taken lightly. It is extremely 
important that the research is highly regulated 
and that we have an open and honest discus- 
sion about it. As with all research, experiments 
involving animals should not proceed without 
the broad consent of society. 

It is for this reason that I am worried about 
the Stop Vivisection European Citizens’ Initia- 
tive. The Italian-based initiative wants to scrap 
animal research in Europe by repealing a Euro- 
pean directive that protects animals used for 
science. It has gathered more than one million signatures, and so 
gets to present its case next week to the European Parliament, which 
is largely composed of new members. 

The directive was introduced in 2010 after a long and considered 
process of debate with a wide range of groups, and came into full 
effect in 2013. It is ambitious legislation that requires the raising of 
welfare standards across the continent, and obliges licensed research- 
ers to adopt the ‘3Rs’: reduction, refinement and replacement. 

My lab in Oxford, UK, has to renew its licence to carry out 
research in animals from the local committee every five years, 
with a review halfway through. Welfare is an extremely important 
concern. We have found that the best way to continually improve 
standards is for my research team to work closely with animal-house 
staff to explain the experiments being carried out and why they are 
being done. In this way, those who look after 


[= spent years meticulously trying to understand the disease 


the animals every day understand what effects NATURE.COM 
the animals might be experiencing, and can _ Discuss this article 
therefore tailor their care. online at: 

All funders, including the European _ go.tiature.com/nbkq4f 


TECHNOLOGY 


IS CONTINUALLY 
HELPING US T0 


REDUCE 


THE NUMBER OF 


ANIMALS 


NEEDED. 


Keep the directive that 
protects research animals 


Losing legislation that has animal welfare at its core would not just jeopardize 
science, it is also likely to lead to a drop in standards, argues Kay Davies. 


Commission, want as few animals as possible to be involved in 
research. This is not always easy, because doing experiments with 
too few animals can be just as damaging as using too many. If the 
results are not significant because the sample sizes are too small, then 
the work and the animals have been wasted. We need to justify, in 
statistical terms, the appropriate number of animals. 

In my view, technology is continually helping us to reduce the num- 
ber of animals needed. Already in my lab, we are developing imaging 
technology that enables us to monitor the response to candidate drugs 
in mice, so experiments can be shorter, less invasive and use fewer 
animals. Other advances, such as tissue engineering, stem-cell tech- 
nology and computer modelling, also show promise for refining the 
use of animals and providing potential alternatives. On this score, the 
directive has placed further duties on the Euro- 
pean Centre for the Validation of Alternative 
Methods, a body that makes sure that substitute 
methodologies are at least as reliable as equiva- 
lent methods using animals. Since 1990, it has 
validated more than 30 such alternatives. 

Removal of the directive would be a signifi- 
cant step backward both for animal welfare in 
the European Union and for Europe's leading 
role in advancing human and animal health. 
Biomedical researchers have a duty to continue 
trying to develop treatments, and there is always 
what I think of as a ‘tipping point’ between our 
confidence in our experimental findings and 
starting human trials. We want treatments to be 
available for humans as soon as possible and, as 
we develop alternative testing techniques, this 
will shift further and further towards being possible with very little 
experimentation in animals. It would be wonderful to see the day 
when we can do experimental medicine in small groups of patients 
without validation in animals first, but we are not there yet. It is sim- 
ply too great a risk to patients. 

There are always going to those who disagree with animal 
research, and the scientific community must show where, perhaps 
unexpectedly, we share common ground. Like those who disagree 
entirely, we do not want animals to be used in research forever, and 
we must make our thinking behind the 3Rs and alternatives better 
understood. The directive makes a massive contribution to animal 
welfare, and allows experiments that are necessary to realizing the 
medical benefits of my research and that of countless other European 
scientists. Abolishing it would not be good for basic science, medical 
progress or animal welfare. = 


Kay Davies is director of the MRC Functional Genomics Unit at the 
University of Oxford, UK. 
e-mail: kay.davies@dpag.ox.ac.uk 
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A fix for faulty 
mitochondria 


Using genome editing to target 
faulty DNA in mitochondria 
— the cell’s powerhouses — 
could prevent the inheritance 
of mitochondrial defects that 
cause disease. 

Juan Carlos Izpisua 
Belmonte at the Salk Institute 
for Biological Studies in 
La Jolla, California, and his 
colleagues injected RNA 
into mouse embryos and 
eggs containing a mix of 
mitochondria from two 
different mouse strains. The 
RNA was programmed to 
produce an engineered enzyme 
that targets and cuts DNA 
from only one mouse strain. 
Around 60% of the targeted 
mitochondrial DNA was 
destroyed. 

The researchers also reduced 
the levels of faulty DNA in 
mouse eggs that contained 
mitochondria from humans 
with a mitochondrial disorder. 
Eventually, similar techniques 
might be used to prevent the 
transmission of such diseases 
while avoiding a controversial 
method that makes eggs 
containing mitochondrial 
DNA from two women. 

Cell 161, 459-469 (2015) 


Targeting multiple 
myeloma 


A new way of blocking 
enzymes that destroy 
faulty proteins could fight 
the blood cancer multiple 
myeloma. 

Proteasomes are enzyme 
complexes that degrade 
misfolded proteins, and 
disrupting them can stop some 
cancers from proliferating. But 
cancer cells quickly become 
resistant to such inhibition, 
so Thomas Kodadek of the 
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Selections from the 
scientific literature 


Night lights bring a sea change 


Light pollution may be driving changes in 
marine ecosystems, attracting some species and 
discouraging others. 

Thomas Davies at the University of Exeter, UK, 
and his colleagues placed plastic panels in the sea 
offnorth Wales and illuminated some with white 
light-emitting diodes (LEDs) at night while leaving 
others in the dark. After 12 weeks, the lit and unlit 
panels had been colonized by markedly different 
animals. Those influenced by light included both 


mobile and fixed species. Illumination reduced 
the numbers of hydroid Plumularia setacea, 
while the tube-building worm Spirobranchus 
lamarcki and the mobile crustacean Metis ignea 
seemed to be attracted by light. A rise in the use 
of LED lights, which emit a broader range of 
wavelengths than older lighting technologies, 
could increase the rate of marine ecological 
changes driven by light pollution. 

Biol. Lett. http://doi.org/36n (2015) 


Scripps Research Institute 

in Jupiter, Florida, and his 
colleagues targeted a different 
part of this system to try to 
avoid resistance. They showed 
an anti-cancer effect from 
blocking a receptor called 
Rpn13, which is present in 
elevated amounts in cancer 
cells but is not targeted by 
traditional proteasome 
inhibitors. The blocking 
molecule binds selectively to 
Rpn13 and is toxic to multiple- 
myeloma cells. 

Their work confirms an 
earlier report that it is possible 
to bind a drug-like compound 
specifically to the Rpn13 
receptor. 

J. Am. Chem. Soc. http://doi. 
org/372 (2015) 
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Mummies’ stature 
reveals inbreeding 


The heights of the mummified 
pharaohs who ruled ancient 
Egypt support the belief that 
they married their siblings. 
Historical records say that 
many Egyptian pharaohs 
matried their sisters, but it 
is hard to prove through 
genetic testing because 
of ethical objections to 
destroying mummies’ 
tissues (pictured is 
Rameses III, who 
was Pharaoh in 
1186-1155 Bc). 
Frank Ruhli at 
the University of 


Zurich in Switzerland and his 
colleagues used body height, 
which is heavily dependent on 
genetics, to look for evidence 
of inbreeding in 259 mummies 
of both commoners and 
royals. Pharaohs varied less in 
height than men in the general 
population, which suggests 
that royal Egyptians may 
have been more inbred than 
commoners, the authors 
say. Pharaohs also tended 
to be taller than non- 

royal men from the same 

time period. Royal and 

non-royal women were 


s jem equally variable in 


height. 
Am. J. Phys. 

\ Anthropol. http:// 
doi.org/37x (2015) 
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Failing to see dead 
wood for the trees 


Forests could be emitting 
more carbon than is believed, 
because they store large and 
previously unrecognized 
quantities of dead wood. 

Such wood provides 
resources for new growth but 
releases carbon as it decays. 
Marion Pfeifer at Imperial 
College London and her team 
examined the amount of dead 
wood in 193 plots measuring 
25 metres by 25 metres in 
Sabah, Malaysia. The plots 
ranged from pristine forest to 
areas that had been cleared for 
oil-palm plantations. 

The authors estimated that 
dead wood can hold just 5.4% 
of the carbon in some pristine 
forest, but may contain up to 
64% of the carbon at sites where 
selected trees had been logged. 
This value is higher than is 
often assumed in the literature, 
they say, suggesting that some 
estimates of the net amount of 
carbon trapped by living wood 
in forests may be too high. 
Environ. Res. Lett. 10,044019 
(2015) 


SEISMOLOGY 


Crust crunch leads 
to huge quakes 


The world’s biggest earthquakes 
can happen in clusters if the 
geological geometry is right. 

Great quakes of 
magnitude 8.5 and above occur 
where one plate of Earth’s 
crust dives beneath another. 

If the width of plate overlap is 
particularly wide — more than 
around 120 kilometres — then 
even a single huge earthquake 
cannot relieve all the stress 
that builds up between the 

two plates, says a team led by 
Robert Herrendérfer of ETH 
Zurich in Switzerland. 

The team’s modelling 
suggests that only a series of 
huge quakes, culminating ina 
final big one, can overcome the 
wide plate overlap and relieve 
seismic stress. This ‘supercycle’ 
could explain larger-than- 
expected quakes, such as the 


2011 Tohoku event in Japan. 
Supercycles might also occur in 
regions that have not previously 
been thought to be at risk, such 
as Alaska and the Antilles. 
Nature Geosci. http://dx.doi. 
org/10.1038/nge02427 (2015) 


Instability drives 
abstract art 


Abstract patterns by Mexican 
artist David Alfaro Siqueiros 
are the result of a well-studied 
problem in fluid dynamics. 

Roberto Zenit at the National 
Autonomous University of 
Mexico in Mexico City and 
his team replicated the artist's 
‘accidental painting’ technique, 
in which layers of paint are 
poured on a horizontal surface, 
over time creating whorls, 
blobs and other shapes. The 
authors successfully produced 
patterns similar to those in 
Siqueiros's work using a dense 
fluid on top ofa light one — an 
unstable arrangement because 
the heavier liquid pushes 
through the light one, forming 
characteristic shapes. 

This ‘Rayleigh-Taylor 
instability’ between two fluids 
has already been widely studied 
because it is important in 
many natural and engineering 
systems. An experimental 
arrangement similar to that 
used by Siqueiros could help 
physicists to advance such 
studies, the authors say. 

PLoS ONE http://doi.org/10.1371/ 
journal.pone.0126135 (2015) 


Fossils show 
extinction risk 


Fossil records can suggest 
which modern marine species 
would be at risk of extinction in 
the absence of human activity. 
Seth Finnegan at the 
University of California, 
Berkeley, and his team used. 
fossils from the past 23 million 
years to examine the extinction 
risk of 2,897 genera from 
6 groups — sharks, marine 
mamumnals, scleractinia corals, 
bivalve molluscs, echinoderms 
such as sea urchins, and snails. 
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SOCIAL SELECTIO 


Popular articles 
on social media 


Sexist review causes Twitter storm 


Scientists on social media have reacted with dismay to a peer 
reviewer who suggested that a rejected study — which looked 
at gender differences in postdoctoral job opportunities — 
could be improved by adding a male co-author. Fiona Ingleby, 
lead author of the still-unpublished study and an evolutionary 
biologist at the University of Sussex, UK, revealed excerpts 
of the anonymous review on Twitter on 29 April, setting off 
a flurry of retweets and responses. “Just in case you thought 
blatant sexism in academia is gone, look at recent tweets from 
@Fionalngleby,’ tweeted Seth Zenz, a particle physicist at 
Imperial College London. Caitlin Casey, 


> NATURE.COM 
For more on 

popular papers: 
go.nature.com/ekdfay 


They found that small 
geographic ranges consistently 
increased extinction risk, and 
that some broad taxonomic 
groups had a consistently 
higher extinction risk than 
others. Mapping these two 
features onto modern relatives 
of these six groups shows 
the distribution of baseline 
extinction risk, with hotspots in 
the Indo-Pacific and Western 
Atlantic tropics. Overlaps 
between this baseline and 
human-derived pressures could 
point to areas of particular 
vulnerability. 

Science 348, 567-570 (2015) 


IceCube gives 
neutrinos flavour 


High-speed neutrinos detected 
by the IceCube observatory at 
the South Pole do come from 
distant cosmic sources such as 
galactic nuclei, after all. 

Neutrinos are subatomic 
particles that come in three 
‘flavours’ — electron, muon 
and tau — which change as 
they travel over long distances. 
Previous IceCube data 
suggested that few muon or 
tau types had been detected in 
2010-12, indicating a possible 
non-cosmic source or exotic 
new physics affecting how 
neutrinos oscillate. 

Gary Binder of the 


an astronomer at the University of 
California, Irvine, tweeted: “And this is 
one of the reasons we need double-blind 
refereeing” 


University of California, 
Berkeley, and his colleagues 
identified 137 high-energy 
neutrinos in this date range. 
They ruled out mainly electron 
and mainly muon scenarios, 
concluding that the ratio of 
flavours is consistent with 1:1:1. 
This is the expected ratio from 
neutrinos oscillating over long 
distances, and further confirms 
that they originated from 
distant astrophysical objects. 

A separate study by 
Francesco Vissani at the 
Gran Sasso Science Institute 
in LAquila, Italy, and his 
colleagues looked at higher- 
energy neutrinos detected by 
IceCube (pictured is one of 
the sensors in the ice) and also 
concluded that these are likely 
to be from cosmic sources. 
Phys. Rev. Lett. 114,171102; 
171101 (2015) 
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UK to head SKA 


The Square Kilometre Array 
(SKA) will be headquartered at 
the Jodrell Bank Observatory 
near Manchester, UK, the 
SKA organization announced 
on 30 April. Once completed 
in the 2020s, SKA will 

be world’s largest radio 
observatory — a network of 
more than 100,000 antennas 
shared between Australia 

and South Africa that will 
study phenomena of the 

early Universe. In reaching 

its decision, the organization 
rejected the recommendation 
of an expert panel in 

March that the site for the 
headquarters should be 
Padua in Italy (see Nature 
http://doi.org/38h; 2015). A 
Nature Editorial criticized the 
organization’s hesitation in 
following the panel’s advice 
(see Nature 519, 129; 2015). 


Institute renamed 
The premier US advisory 
board for medical research 
has voted to change its name. 
On 28 April, the Institute of 
Medicine in Washington DC 
announced that it is becoming 
the National Academy of 
Medicine. The change aligns 
the group’s name with those 
of its sister organizations, the 
National Academy of Sciences 
and the National Academy of 
Engineering, and will become 
official on 1 July. 


MESSENGER ends 


Ina spectacular mission- 
ending crash, NASA‘s 
MESSENGER spacecraft 

hit Mercury on 30 April as 
planned. The probe ran out of 
fuel and disappeared on the far 
side of the planet, as seen from 
Earth. MESSENGER began 
orbiting Mercury in 2011, and 
the mission lasted far beyond 
its expected lifetime of one 


The news in brief 


Vatican gathers leaders for climate talks 


Religious leaders, scientists and policy- 
makers met at the Vatican on 28 April 

to discuss climate change. Although the 
Pontifical Academy of Sciences has held 
climate-related conferences at the Vatican 
before, this was the first to include the leaders 
of various religions. Pope Francis is preparing 


year. The probe's discoveries 
include ice at Mercury's 
poles and puzzlingly high 
amounts of volatile elements 
such as sulfur and chlorine. 
The next Mercury mission, 
the joint European-Japanese 
BepiColombo, will launch 
in 2017. See go.nature.com/ 
eetga2 for more. 


Rubella eliminated 
The World Health 
Organization (WHO) 
declared on 29 April that 

the Americas is the first of 

its regions to have formally 
eliminated rubella. The 
WHO’ Global Vaccine Action 
Plan calls for elimination of 
the disease in two of the six 
WHO regions by 2015, and 
in five by 2020. Elimination 
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can be fragile, however; 
measles was eradicated from 
the Americas in 2002 but has 
resurged in recent years, with 
imported cases spreading in 
unvaccinated populations. 
And with the global level of 
rubella vaccination coverage 
at just 40%, and that of several 
other diseases also low, the 
WHO last month warned that 
global vaccination targets are 
“far off track”. 


NASA balloon leak 
NASAs latest super-pressure 
balloon fell to Earth on 27 April 
after developing a leak 32 days 
into its flight circling around 
southern latitudes from New 
Zealand to Australia. It was the 
longest flight yet in which the 
balloon endured day-night 
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an ‘encyclical’ letter on climate change to 
send to bishops this summer ahead of the 
United Nations climate talks in Paris this 
December. Delegates at the meeting included 
UN secretary-general Ban Ki-moon, Italy’s 
President Sergio Mattarella and President 
Rafael Correa of Ecuador. 


cycles that challenge its ability 
to stay aloft. The mission had 
aimed to beat the record for the 
longest super-pressure-balloon 
flight of 54 days. That flight, 
however, was in the constant 
daylight of the summer 
Antarctic. NASA is developing 
the balloons as a way to loft 
scientific payloads to the edge 
of space. 


Supply-craft failure 
An unmanned Russian 
spacecraft that was due 

to deliver supplies to the 
International Space Station 
(ISS) tumbled out of control 
on 28 April shortly after its 
launch from the Baikonur 
cosmodrome in Kazakhstan. 
Russian officials said that 
the failed Progress M-27M 
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spacecraft may stay in an 
uncontrollable orbit for up to 
two weeks before it re-enters 
the atmosphere. Where debris 
might crash to Earth cannot 
be predicted, but officials say 
that the risk of any hitting 
populated areas is small. The 
ISS crew has sufficient food 
and water left until the next 
planned delivery on 19 June. 


Blast off for Bezos 
Blue Origin, the private 
rocketry company founded 
by Amazon.com’ Jeff 

Bezos and based in Kent, 
Washington, conducted the 
first successful launch of its 
spaceship on 29 April. The 
New Shepard vehicle, named 
after the first American in 
space, Alan Shepard, flew 

up more than 93 kilometres 
from the company’s launch 
site in Texas. One portion, the 
propulsion module, did not 
return to Earth as planned, 
but the company did not 
explain what happened. Blue 
Origin aims to fly tourists and 
research payloads to the edge 
of space on reusable rockets. 


Space icon goes 3D 
The 3D structure of the star- 
forming clouds known as the 
Pillars of Creation (pictured) 
was revealed on 30 April — 
along with news that the iconic 
object's days are numbered. 
The columns, imaged by the 


Hubble Space Telescope in 
1995, lie within the Eagle 
Nebula some 2,100 parsecs 
(7,000 light years) away. 
Astronomers created the 

3D images using the MUSE 
instrument on the European 
Southern Observatory’s Very 
Large Telescope in Chile. The 
structures are pummelled by 
intense ultraviolet radiation 
and stellar winds from the 
nearby star cluster NGC 6611, 
which in spite of forming the 
pillars, will also eventually 
erode them completely in just 
3 million years. 


Brain-scan analysis 
The Human Connectome 
Project (HCP), a US$40-million 
effort to map connections in 
the human brain, has released 
findings that show how 
behavioural and demographic 
factors correlate with brain 
activity. The HCP, which is 
funded by the US National 
Institutes of Health, revealed 
‘MegaTrawl on 29 April. This 


FOLLOW THE MONEY 


TREND WATCH 


UK universities spent £29.4 billion 
(US$44.5 billion) in 2013-14, 
according to data released by 

the Higher Education Statistics 
Agency on 29 April. Teaching 
costs accounted for almost 39% 
of the total, with medicine, 
dentistry and health departments 
drawing the highest amount 

(9%). Research grants cost 14.3%. 
Administrative services took 16%, 
including management salaries, 
public relations and student 
services. Academic services such 
as libraries and computing added 
another £2.5 billion (8.4%). 


How UK universities spent their cash 


Medicine, dentistry and health 
Biology, maths and physical sciences 
Social studies 

Engineering and technology 
Business 
Humanities 

Creative arts 

Education 

Architecture 

Agriculture and veterinary science 
Administrative services 

Research grants 

Premises 

Academic services 

Other 

Residences 


database links brain scans 
from roughly 460 people with 
187 variables, such as age, 
sex, socio-economic status 
and smoking habits. HCP 
co-chair Stephen Smith at the 
University of Oxford, UK, 
says that the data will serve as 
a tool for researchers to find 
meaningful differences in 
brain connections between 
demographic groups. 


Climate pledge 


California has committed to 
reducing its greenhouse-gas 
emissions to 40% below 1990 
levels by 2030, matching the 
European Union's pledge 

as officials prepare for the 
climate summit in Paris 

this December. Governor 
Edmund Brown Jr signed an 
executive order on 29 April 
that extends the state’s current 
commitment — enacted in 
2006 — to reduce emissions to 
1990 levels by 2020. California's 
long-term goal is to reach 80% 
below 1990 levels by 2050. 


Biofuels cap 


Europe has agreed to put a 
brake on biofuels made from 
food crops. By 2020, such 
fuels should provide no more 
than 7% of transport fuel, the 
European Parliament voted 
on 28 April — concluding 
years of wrangling. In 2012, 
the European Commission 
proposed a limit of 5%, 


in 2013-14. 
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SEVEN DAYS | THIS WEEK | 


8-12 MAY 

The American 
Association of 
Immunologists holds 

its annual meeting in 
New Orleans, Louisiana. 
Topics include cancer 
immunotherapy and the 
role of the microbiome 
in immunity. 
go.nature.com/6hodsw 


11-15 MAY 
International experts on 
underwater acoustics 
will gather in Barcelona, 
Spain, to discuss issues 
around ocean noise. 
oceanoise2015.com 


but lobbying from the 
biodiesel industry and some 
member states prompted the 
commission to loosen the 
reins (see Nature 499, 13-14; 
2013). Scientists have long 
warned that fuels such as 
biodiesel made from palm 
oil can produce more carbon 
emissions than the fossil fuels 
they replace. 


| __BUSINESS 
Tesla charges on 


Electric-car maker Tesla 
Motors in Palo Alto, 
California, unveiled batteries 
for residential, commercial 
and utility-company use on 
30 April. Based on tried- 
and-tested lithium-ion cells, 
the batteries are designed 

to smooth out the use of 
solar energy and will be 
available to consumers this 
summer. Experts say that 
energy-storage facilities will 
help to ease the transition to 
the low-carbon electricity 
required to combat climate 
change, but warn that 
battery technology is still too 
expensive for storing large 
amounts of solar power. See 
go.nature.com/umykel for 
more. 


> NATURE.COM 
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DUAL-USE TECHNOLOGY 


s 


Japan’s biggest warship since the Second World War was delivered to the navy at the United Marine shipyard in Yokohama in March. 


Japanese academics fear 
military incursion 


Relationship between traditionally pacifist research community and military is changing. 


BY DAVID CYRANOSKI 


has been resolutely pacifist. So some 
scientists were perturbed last month to 
learn that the country’s defence ministry has 
started a basic-research fund. Although the sum 
— ¥300 million (US$2.5 million) for the first 
year — is a pittance compared with the nation’s 
total research funding, to much of the academic 
community the money is an ominous sign. 
The fund, which appeared in the country’s 
annual budget, is the latest in a slew of develop- 
ments that suggest that Japanese research is 


EB: since the Second World War, Japan 


taking on a military dimension, say a growing 
number of concerned scientists. Also of con- 
cern is a recently introduced government 
project inspired by the US Defense Advanced 
Research Projects Agency (DARPA). 

“Military objectives and military logic could 
start to have a major influence on the academic 
world,’ says Morihisa Hamada, a geochemist 
at the Japan Agency for Marine-Earth Science 
and Technology in Yokohama, who started an 
online campaign decrying the shift last year. 
“To us, this is of particular concern.” 

In 1950, the Science Council of Japan made a 
statement eschewing research tied to war, and 
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such vows have since been renewed. But with 
a hawkish prime minister in Shinzo Abe, and 
tensions with China and North Korea growing, 
the academia—military relationship is changing. 

Debate over the cultivation of research 
with military applications started to heat up 
last year, when the Abe-chaired Council for 
Science, Technology and Innovation created 
a project dubbed ImPACT (Impulsing Para- 
digm Change through Disruptive Technolo- 
gies). The ¥550-billion, 5-year project consists 
of 12 research programmes that the govern- 
ment says were picked because they are high 
risk but offer high commercial rewards. The > 
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government says that this focus, and the 
project's management structure — each 
programme has its own manager, who coor- 
dinates research between mostly university- 
based scientists — were inspired by DARPA, 
a powerhouse for blue-skies research. 

Critics have charged that ImPACT also 
mimics DARPA’ explicit focus on ‘dual-use’ 
technologies with both military and civilian 
applications, and there are signs that it could 
have a military dimension. The military is 
“watching the ImPACT projects closely’, 
says a defence-ministry representative who 
did not want to be named. And Atsushi 
Sunami, a researcher at the National Gradu- 
ate Institute for Policy Studies in Tokyo who 
helped to create ImPACT, says that although 
Abe's main goals were economic, he was also 
drawn to the potential military applications 
“because of the changing security environ- 
ment surrounding Japan, including the 
rising tensions with China’. 

However, some researchers whose pro- 
jects have dual-use potential say that they 
feel little pressure to devise military uses. 

Satoshi Tadokoro of Tohoku University 
will get ¥3.5 billion for a ‘tough robotics’ pro- 
ject developing aerial and legged automatons 
to withstand heat and explosions. He says 
that his project has no connection with the 
military and will be used for disaster relief. 

Similarly, Yoshiyuki Sankai at the Uni- 
versity of Tsukuba and his team are using 
ImPACT funding to develop an exoskeleton 
that picks up nerve signals from its wearer 
and converts them to mechanical force. It 
will be used in health care, to help caregivers 
who have to lift patients, says Sankai. 

The ministry of defence’s ¥300-million 
fund is much less ambitious than ImPACT. 
The money will go to a competitive grant 
programme similar to those run by the 
health and economics ministries. But it will 
be explicitly earmarked for dual-use tech- 
nologies, including ones that create military 
equipment or apply cutting-edge technology 
to defence. 

“In terms of the government strategy of 
asking universities and research institutes 
to do military research, this is a major turn- 
ing point,’ says Norikazu Kameyama, an 
agricultural scientist at the University of 
the Ryukyus in Okinawa. 

University budgets have been cut over 
the past decade, and scientists have become 
increasingly reliant on ministry grants. 
Hamada says that some researchers are 
describing the defence grants as a “bless- 
ing” But he fears that the funding will come 
with restrictions on the ability to share data. 

His is not alone voice. He set up his online 
appeal with Kameyama in March last year, in 
response to earlier signs that the military was 
making incursions into research. It now has 
1,000 signatures. “We are appealing to the 
conscience of scientists,” says Kameyama. m 


14 | NATURE | VOL 521 | 7 MAY 2015 


The New Horizons probe (artist’s impression) will be the first to visit Pluto and its moon Charon close up. 


Pluto mission 


hunts for hazards 


Unknown moons pose a danger to the New Horizons craft 
as it journeys to the edge of the Solar System. 


BY ALEXANDRA WITZE 


n 11 May, a telescope aboard a NASA 
() és will turn and stare at Pluto — like a 

space-robot equivalent ofa sailor watch- 
ing for shoals that could take out his vessel. 

As the New Horizons craft hurtles towards 
its 14 July rendezvous with Pluto, mission 
scientists are preparing to guide it through 
what could be a dangerous dusty realm. It 
will skim the orbit of the dwarf planet’s largest 
moon, Charon, at more than 48,000 kilometres 
per hour. At that speed, hitting a dust particle 
the size ofa rice grain could be fatal. 

To minimize the risk of hitting debris from 
Charon or another, unknown moon, scientists 
will conduct seven 45-minute observation ses- 
sions between 11 May and 1 July. If they find 
a potential hazard, the team can change the 
spacecraft’s course. That would mean forgoing 
some science observations, but it could also 
mean the difference between life and death for 
the first spacecraft ever to visit Pluto. 

Mission leaders calculate that the chance 
of a dust particle knocking out New Horizons 
is just 1 in 10,000. The only way to avoid the 
hazard completely would be to stay away from 
Pluto altogether, says Marc Buie, a mission 
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co-investigator at the Southwest Research 
Institute in Boulder, Colorado. 

New Horizons, now roughly 90 million kilo- 
metres from Pluto, is giving humans their best- 
ever view of the dwarf planet. The spacecraft 
carries the Long Range Reconnaissance Imager, 
a giant zoom lens with a 20.8-centimetre aper- 
ture. During the hazard observations, the 
instrument will scan the space around Pluto 
and its five known moons, looking for objects 
as small as a few kilometres across. 

The mission is almost certain to discover 
new moons in the process. The Hubble Space 
Telescope found two during its hazard searches 
before the Pluto mission: Kerberos, which 
measures 14-40 kilometres across, in 2011, 
and the smaller Styx in 2012. “I wouldn't be 
at all surprised if there were more,’ says Cathy 
Olkin, a New Horizons deputy project scientist 
at the Southwest Research Institute. 

Charon, which is roughly the same width as 
Texas, is thought to have formed during a giant 
impact early in Plutos history. The other four 
known moons are much smaller and may be 
debris spit out during the same collision. “The 
architectural details of this system are fasci- 
nating,” says William McKinnon, a planetary 
scientist at Washington University in St. Louis, 


NASA/JHUAPL/SWRI/STEVE GRIBBEN 
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Missouri. “It’s like a miniature planetary system? 

New-found moons might appear in any of 
a number of places around Pluto, says Buie. 
There is, for example, a pocket that nestles 
inside the orbit of Charon that is gravitation- 
ally stable enough for a moon to survive there 
for millions of years. But planetary scientists 
do not have a plausible explanation for how 
a moon might have appeared there to begin 
with, Buie says. 

Another place that could have small moons 
is outside the orbit of Hydra, Pluto’s outermost 
known moon. Buie hopes to find an exotic 
arrangement: moons sharing the same orbit. 
Any such discovery would rewrite the origin 
story of the Pluto-Charon system, he says. 

As New Horizons gets closer to Pluto, it 
may even spot rings of dust around the dwarf 
planet. These could form when meteorites hit 
the small moons, kicking up dust that drifts 
away from the moons’ low-gravity environ- 
ments and into orbit around Pluto. 

It is this streaming dust that poses the big- 
gest danger to New Horizons. Mission scien- 
tists have been modelling where they think the 
dust from Pluto’s moons might flow, and will do 
so again after the upcoming hazard searches. 

If mission planners do spot something dan- 
gerous, they can divert New Horizons onto one 
of three alternative routes (see ‘Perilous path’). 


PERILOUS PATH 


Unknown moons, debris or dust could interfere 
with the New Horizon spacecraft’s journey to Pluto. 
Mission scientists are scanning for such hazards 
now, and can divert the probe onto one of three 
alternative paths if needed. 


One option would leave the spacecraft on its 
current course but rotate it so that its antenna 
acts as a shield against incoming debris. 
Another possibility would send the craft div- 
ing much closer to Pluto, inside the orbit of 
Charon — assuming that there is no extra 
moon there. A third would nudge the space- 
craft slightly farther away from Pluto. 

The last chance to divert to one of these 
courses comes on 4 July. Choosing any of them 
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Planned path 
Alternative paths 


Kerberos 


e 
Hydra 


would burn some of the fuel that engineers 
need to steer the spacecraft when it departs 
Pluto for a planned visit to a second icy world 
in the outer Solar System's Kuiper belt. 

Even so, mission planners have a backup 
for this backup. They will command New 
Horizons to radio a little of its crucial data 
back to Earth in the two days before its Pluto 
encounter — just in case something takes the 
spacecraft out. = 


BIOTECHNOLOGY 


Pint-sized DNA sequencer 
impresses first users 


Portable device offers on-the-spot data to fight disease, catalogue species and more. 


BY ERIKA CHECK HAYDEN 


to Guinea with three genetic sequenc- 
ers packed in his luggage. That fact alone 

is astonishing: most sequencing machines 
are much too heavy and delicate to travel as 
checked baggage in the hold of a commercial 
airliner. What came next was even more 
impressive. For 12 days, Quick used these 
sequencers — called MinIONs — to read the 
genomes of Ebola viruses from 14 patients in as 
little as 48 hours after samples were collected. 
That turnaround has never been available to 
epidemiologists in the field before, and could 
help them to trace sources of infection as they 
try to stamp out the West Africa epidemic. 
The European Mobile Laboratory Project, 
based in Hamburg, Germany, is building a 
dedicated MinION lab at a treatment centre 
in Coyah, Guinea, where the machines will 


I: April, Joshua Quick boarded a plane 


be used to sequence patient DNA. 

“This is democratization of sequencing,” 
says Quick, who studies Ebola genomes with 
Nicholas Loman (both are bioinformaticians 
at the University of Birmingham, UK). “You 
dont have to rely on expensive infrastructure 
and costly equipment” 

Quick and Loman’s research exemplifies 
what excites biologists about the MinION, a 
palm-sized gene sequencer made by UK-based 
Oxford Nanopore Technologies. The device is 
portable and cheap. It can read out relatively 
long stretches of genetic sequence, an ability 
increasingly in demand for understanding 
complex regions of genomes. And it plugs into 
the USB port ofa laptop, displaying data on the 
screen as they are generated, rather than at the 
end ofa run that can take days. 

“The MinION is pretty extraordinary,” 
says David Deamer, a biochemist at the 
University of California, Santa Cruz. “We're all 


on kind ofa high right now.” 

The MinION was first released last spring 
through an early-access programme that 
offered researchers the device and flow 
cells — the disposable working guts of the 
sequencer — for a US$1,000 deposit. At a 
meeting in London on 14 and 15 May, users 
will share their experiences of testing the 
device and writing programs to analyse the 
data it generates. For example, Deamer is using 
it to detect and study the kinds of nucleic acids 
that might have given rise to Earth’s earliest life. 

The pilot programme has helped Oxford 
Nanopore to recover from a recent gaffe. It 
promised in 2012 that in 2013 it would release 
not just the MinION but also other machines 
able to sequence a human genome in 15 min- 
utes. Those superfast machines have not yet 
come to pass, but in February 2014 the com- 
pany rolled out the MinION. Initial tests 
suggested that it was not nearly as fast as > 
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had been promised and made a worrying 
number of errors. But Oxford Nanopore has 
made improvements; recent publications by 
pilot users have shown that a MinION can 
do quite a lot. It can reliably sequence small 
genomes, such as those of bacteria and yeast. 
It can discriminate between closely related 
bacteria and viruses, read complex portions of 
the human genome, and differentiate between 
the two versions of a gene that are carried on 
each chromosome pair. 


SEQUENCING ON THE FLY 

Oxford Nanopore’s progress has some biolo- 
gists imagining new ways to use sequencing 
in the field. On 4 May, Italian biologists toted 
MinIONs and other equipment into a rain- 
forest in southern Tanzania to sequence the 
genome of a native frog. Karen James at the 
MDI Biological Laboratory in Bar Harbor, 
Maine, wants to put it in the hands of citizen 
scientists to survey biodiversity in Maine’s 
Acadia National Park. If Oxford could mod- 
ify the MinION to run on an iPhone, an app 
written by computational biologist Michael 
Schatz at Cold Spring Harbor Laboratory in 
New York might enable on-the-spot identifi- 
cation of unknown organisms using just the 
two hand-held devices. NASA scientists even 
plan to send a MinION to the International 
Space Station, where astronauts would test it 
in microgravity. If that goes well, astrobiologist 
Aaron Burton at the Johnson Space Center in 
Houston, Texas, suggests that a hand-held 
sequencer might be sent to Mars to look for 
molecular signs of life on the red planet. 

Yet there is still plenty that the MinION 
cannot do. It is not practical to sequence large 
genomes with the device: Schatz estimates that 
it would take a year for the current version to 
sequence the equivalent of a human genome. 
And the machine still misidentifies DNA 
sequence 5-30% of the time — a high error 
rate compared with those of existing full-sized 
sequencers. It also has trouble reading long, 
repetitive regions of sequence. 

Improvements are coming: the MinION 
reads each genetic base many times, and bio- 
informaticians in the pilot programme have 
developed error-correcting methods to com- 
bine data from these reads into results that 
have a much lower overall error rate. 

Biologists want more from the MinION — 
faster and more accurate performance, and 
the ability to simply drop a sample onto the 
sequencer with no preparation. Users hope that, 
at the London meeting, the company will unveil 
improvements in the speed and performance of 
the technology that it has hinted at on Twitter 
over the past month. The commercial price is 
also a big unknown; low cost is a huge part of the 
device's appeal, but Oxford will need to prove 
to its investors that they can still make money 
selling devices cheaply. If, as Deamer says, biolo- 
gists are on a high right now, the company still 
has much to do to keep the buzz from fading. = 
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Fossil-fuel energy is a profitable business, but institutions are being urged not to invest in it. 


Reality check for 
fossil-fuel divestment 


Academics urge other ways to reduce carbon emissions. 


BY JEFF TOLLEFSON 


he fossil-fuel divestment movement has 

divided academia. On 24 April, SOAS 

University of London announced that 
it would sell off all of its investments in fossil- 
fuel companies within three years. Days later, 
the New York University (NYU) senate voted 
to recommend that the university keep its 
current fossil-fuel investments, but to develop 
a greener strategy for the future. Other institu- 
tions, including Harvard University in Cam- 
bridge, Massachusetts, and most recently 
Swarthmore College in Pennsylvania, have 
rejected calls for divestment. 

Even if the movement does convince 
institutions to pull billions of dollars out of 
fossil fuels, economists generally agree that 
divestment will have little impact on the 
energy industry or greenhouse-gas emis- 
sions. Supporters themselves acknowledge 
that it is mostly a symbolic gesture. But some 
academics fear that the movement could 
draw attention away from more effective 
ways to encourage low-carbon energy. 

“We need to focus on actions that are going 
to make a real difference,’ says Rob Stavins, 
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an economist at Harvard. 

Students and environmentalists launched 
the divestment movement a few years ago, 
targeting major institutional investors such 
as public pension funds, government-held 
investment funds and philanthropic founda- 
tions. Organizers say that nearly 30 universi- 
ties worldwide have made commitments to 
divest, or to otherwise revise their investment 
strategies. But investment experts say that 
divestment is unlikely to make a difference. 
“We don't believe in divestment as a strategy 
— it’s naive, and it doesnt solve the problem,” 
says Julian Poulter, chief executive of the Asset 
Owners Disclosure Project in London, which 
works with institutions to build greener invest- 
ment portfolios to reduce risk as governments 
impose emissions-reducing regulations. 

At NYU, faculty members, students and 
administrators in its 129-person senate 
adopted a resolution suggesting that the board 
of trustees maintain fossil-fuel investments 
totalling roughly US$139.7 million — around 
4.1% of the $3.4-billion endowment. However, 
the resolution calls on the university not to put 
further money into fossil-fuel companies, and 
instead to develop a plan to invest in firms that 
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“demonstrate a commitment to mitigate the 
effects of burning fossil fuels”. 

“Tm happy with this outcome; it’s a sign of 
progress,’ says David Frank, an environmental 
ethicist at NYU. He signed a petition in sup- 
port of divestment, and says that the campaign 
is raising awareness about the problem, even if 
it does not provide any specific solutions. 

Others complain that divestment misses the 
central role of fossil fuels in modern life. “How 
can you vilify the consumption of something 
that you use every day?” says Frank Wolak, an 
economist at Stanford University in California. 

He and others hope to shift students’ atten- 
tion to what they consider a more effective strat- 
egy for weaning universities — and the world 
— off fossil fuels. On 20 April, Yale University in 


New Haven, Connecticut, announced plans to 
develop an internal carbon tax to reduce emis- 
sions. Wolak has proposed a similar approach 
at Stanford, and his team aims to develop a net- 
work of universities dedicated to the idea. 

Such a network will not have a major impact 
on global carbon dioxide emissions, but econ- 
omists have called for the same approach on 
a global scale. If fossil-fuel emissions were 
assigned a fee on the basis of their environ- 
mental impact, then dirty energy would 
become more expensive than cleaner alterna- 
tives. That would naturally shift investment 
toward cleaner sources. 

At Yale, the carbon fee would be revenue 
neutral, similar to national proposals from think 
tanks in Washington DC. Money would be 
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transferred from departments that have higher 
greenhouse-gas emissions than the university 
average to those that perform better. Students 
could participate by helping to cut emissions. 

The idea is to give students and faculty 
members experience that they can carry into 
the real world, says William Nordhaus, an 
economist at Yale who led the development 
of the proposal. Nordhaus expects the univer- 
sity to start a pilot programme in the coming 
academic year, and to expand to all operations 
within three years. 

“If you want to do something about climate 
change, then you have to do something about 
prices,” he says. “You are not going to solve 
the problem by beating up on companies.” = 
SEE EDITORIAL P.6 
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PARTICLE PHYSICS 


Mysterious galactic signal 
points LHC to dark matter 


High-energy particles at centre of Milky Way now within scope of Large Hadron Collider. 


BY DAVIDE CASTELVECCHI 


in physics. But an explanation may be in 

sight for a mysterious excess of high-energy 
photons at the centre of the Milky Way. The 
latest analysis’ suggests that the signal could 
come from a dark-matter particle that has just 
the right mass to show up at the world’s largest 
particle accelerator. 

The Large Hadron Collider (LHC), housed 
at the CERN particle-physics laboratory near 
Geneva, Switzerland, is due to restart colliding 
protons this summer after a two-year hiatus. 
Physicists there have told Nature that they now 
plan to make the search for such a particle a top 
target for the collider’s second run. 

A positive detection would resolve the source 
of the galactic y-rays. But it would also reveal the 
nature of dark matter, the invisible stuff thought 
to make up around 85% of the Universe’s 


> 
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E is one of the most disputed observations 


matter, and would be long-sought evidence for 
supersymmetry, a grand way to extend the cur- 
rent standard model of particle physics. 

“This could very well be the single most 
promising explanation for the Galactic Centre 
proposed to date,’ says Dan Hooper of the Fermi 
National Accelerator Laboratory (Fermilab) in 
Batavia, Illinois, although he adds that “there are 
quite a few others that are not too far behind” 

In 2009, Hooper and Lisa Goodenough, then 
a graduate student at New York University, were 
the first to spot the signal’, in data from NASA’s 
Fermi Gamma-Ray Space Telescope. They pro- 
posed that the bump was a signature of dark 
matter. Two colliding dark-matter particles 
would annihilate each other, just as ordinary 
matter does with antimatter. The annihilation 
would generate a succession of short-lived par- 
ticles that would eventually produce y-rays. 

But the proposed particle, which has been 
dubbed the hooperon or gooperon after its 


proponents, soon ran into problems with 
physicists’ favourite version of supersymmetry. 
Although the minimal supersymmetric standard 
model (MSSM) allows for dark-matter particles 
with the estimated mass of hooperons — about 
25-30 gigaelectronvolts (1 GeV is roughly the 
mass of a proton) — multiple experiments had 
suggested that the particles must be heavier. 
To accommodate hooperons, MSSM would 
have to be modified to an extent that makes 
many physicists uncomfortable. “It would have 
required a completely new theory,’ says Sascha 
Caron, a particle physicist at Radboud Univer- 
sity Nijmegen in the Netherlands, who leads the 
team behind the latest calculations. 

Sceptics suggested that the y-ray excess 
spotted in the Fermi data had more-mundane 
explanations, such as emissions from neutron 
stars or from the remnants of exploded stars. 

But in late 2014, it emerged that calculations 
for the range of dark-matter-particle masses > 
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> that would be compatible with the Fermi 
bump were too conservative. Fresh estimates of 
the y-ray ‘noise’ produced by known sources, 
provided by the Fermi science team’ and oth- 
ers’, allow for much heavier particles. “The 
excess can be explained with a particle of up to 
200 GeV,” says Simona Murgia, a physicist at the 
University of California, Irvine, and a leading 
scientist in the Fermi team. 


BIG-BANG FIT 
Armed with this insight, Caron and his 
collaborators recalculated the predictions of 
the MSSM theory and found another potential 
explanation for the excess — an existing dark- 
matter candidate called a neutralino. The neu- 
tralino was heavy enough not to be excluded by 
previous experiments, yet light enough to poten- 
tially be produced in the second run of the LHC. 
Caron’s model also produces a prediction 
for the amount of dark matter that should have 
been created in the Big Bang that is compatible 


with state-of-the-art observations of the cosmic 
microwave background — the relic radiation 
of the Big Bang — performed by the European 
Space Agency’s Planck probe (see Nature http:// 
doi.org/38k; 2014). This cannot be a coinci- 

dence, he says. “I find 


“This could very ake pomiee i 
well be the single jalan ies 
ee the only one reana- 
mostpromising |. g the Fermi 
lanction ysing the Fermi 
oF he Galacit bump in light of the 
for the ae actic new estimates. Simi- 
Centre. 


lar but less-detailed 
calculations done by 
Fermilab physicist Patrick Fox and his col- 
leagues last November” also revealed the neu- 
tralino as a potential cause of the Fermi y-rays. 
And Katherine Freese, director of Nordita, the 
Nordic Institute for Theoretical Physics in 
Stockholm, says that she and her collabora- 
tors have calculated that the excess could be 
caused by a type of dark matter that features 


in a less-popular theory of supersymmetry. 

Resolution may be just around the corner. 
In addition to being produced at the LHC, the 
neutralino could also be within the shooting 
range of next-generation underground exper- 
iments that are trying to catch dark-matter 
particles that happen to fly through Earth, 
says physicist Albert De Roeck, who works 
on the CMS, one of the two LHC detectors 
that will hunt for dark matter. If such a parti- 
cle is indeed the cause of the y-rays, he says, 
“it seems that the dark-matter signals should 
be observed very soon now”. m 
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PALAEOGENETICS 


Mammoth genomes hold 
recipe for Arctic elephants 


Catalogue of genetic differences reveals how ice-age giants braved the cold. 


BY EWEN CALLAWAY 


oolly mammoths, unlike their 
elephant cousins, were creatures 
of the cold, with long hairy coats, 


thick layers of fat and small ears that kept heat 
loss to a minimum. Now, for the first time, 
scientists have comprehensively catalogued 
the hundreds of genetic variations that gave 
rise to these differences. 

The research reveals how woolly mam- 
moths (Mammuthus primigenius) evolved 
from the ancestor they share with Asian 
elephants (Elephas maximus; see ‘Mammoth 
divergence’). It could even serve asa recipe for 
engineering elephants to live in Siberia. “These 
are genes we would need to alter in an elephant 
genome to create an animal that was mostly 
an elephant, but actually able to survive some- 
where cold,” says Beth Shapiro, an evolution- 
ary geneticist at the University of California, 
Santa Cruz, who was not involved in the lat- 
est research. As fanciful as it sounds, such an 
effort is ata very early stage in a research lab in 
Boston, Massachusetts. 

The first woolly mammoth genome’ was 
published in 2008, but contained too many 
errors to reliably pinpoint how it differs from 


the elephant genome. Other studies singled out 
variations in mammoth genes that would have 
led to light-coloured coats’ and oxygen-carrying 
haemoglobin proteins that work in the cold’. 
In the latest study, a team led by Vincent 
Lynch, an evolutionary geneticist at the Uni- 
versity of Chicago in Illinois, describe how 
they sequenced the genomes of three Asian 
elephants and two woolly mammoths (one 
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Woolly mammoths and elephants are 
descended from a common ancestor. 
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died 20,000 years ago, another 60,000 years 
ago). They found that about 1.4 million DNA 
letters differ between mammoths and ele- 
phants, which altered the sequence of more 
than 1,600 protein-coding genes. The study* 
was posted on the biology preprint server 
bioRxiv.org on 23 April. 

A search for what those proteins do in other 
organisms revealed genes that could be relevant 
to life in the Arctic. Several were involved in 
setting the circadian clock, a potential adapta- 
tion to living in a world with dark winters and 
24 hours of daylight in summer. The mammoth 
genomes also contained extra copies of a gene 
that controls the production of fat cells, and 
some of the genes that differ between the two 
species are involved in sensing heat and trans- 
mitting that information to the brain. 

In the case of one heat-sensing gene, which 
encodes a skin protein called TRPV3 that also 
regulates hair growth, the team ‘resurrected’ 
the mammoth version. They inserted the gene 
sequence into human cells in the lab, which 
then made the protein. Exposing the mammoth 
TRPV3 to different temperatures revealed that it 
is less responsive to heat than the elephant ver- 
sion is. The next step, says Lynch, is to insert 
the same gene into elephant cells that have 
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ago). They found that about 1.4 million DNA 
letters differ between mammoths and ele- 
phants, which altered the sequence of more 
than 1,600 protein-coding genes. The study* 
was posted on the biology preprint server 
bioRxiv.org on 23 April. 

A search for what those proteins do in other 
organisms revealed genes that could be relevant 
to life in the Arctic. Several were involved in 
setting the circadian clock, a potential adapta- 
tion to living in a world with dark winters and 
24 hours of daylight in summer. The mammoth 
genomes also contained extra copies of a gene 
that controls the production of fat cells, and 
some of the genes that differ between the two 
species are involved in sensing heat and trans- 
mitting that information to the brain. 

In the case of one heat-sensing gene, which 
encodes a skin protein called TRPV3 that also 
regulates hair growth, the team ‘resurrected’ 
the mammoth version. They inserted the gene 
sequence into human cells in the lab, which 
then made the protein. Exposing the mammoth 
TRPV3 to different temperatures revealed that it 
is less responsive to heat than the elephant ver- 
sion is. The next step, says Lynch, is to insert 
the same gene into elephant cells that have 
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been programmed to behave like embryonic 
cells, and so can be turned into various cell 
types. These could then be used to examine 
how mammoth proteins work different tis- 
sues. Lynch's team also plans to test the effects 
of other mammoth variations in this way. 


MAMMOTH TASK 

Similar work is already under way in the lab of 
geneticist George Church at Harvard Medical 
School in Boston. Using a technology known as 
CRISPR/Cas9 that allows genes to be edited, his 
team claims to have engineered elephant cells 
that contain the mammoth version of 14 genes 
potentially involved in cold tolerance. 

The work, says Church, is a preamble to 
potentially editing into the elephant genome 
all the variations that differentiate it from the 
woolly mammoth genome — and perhaps, one 
day, even resurrecting the woolly mammoth, or 
at least giving an Asian elephant enough mam- 
moth genes to survive in the Arctic. A reserve 
in north Siberia, dubbed Pleistocene Park, has 
even been proposed as a home for such a popu- 
lation of cold-tolerant elephants. 

It is not clear whether this would ever be 
feasible, however. Such a venture could allow 
endangered Asian elephant populations to 
thrive, but innumerable hurdles stand in the 
way of breeding genetically modified ‘woolly 
elephants. Shapiro outlines these in her 
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Woolly mammoths (artist’s impression) evolved from a shared elephant ancestor to survive the cold. 


book How to Clone a Mammoth (Princeton 
University Press, 2015; see review on page 30), 
charting the ethics of using reproductive tech- 
nologies for an endangered species and the fact 
that the field of elephant reproductive biology 
is still immature. “I probably should have called 
the book How One Might Go About Cloning 
a Mammoth (Should It Become Technically 


Possible, And If It Were, In Fact, a Good Idea, 
Which It’s Probably Not),” Shapiro says. “But 
that was a much less compelling title” m 
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Five researchers reveal the 
challenges of retiring to make 
way for the next generation. 


BY MEGAN SCUDELLARI 


he was running out of time to fulfil his scientific ambitions. It was at 

a lecture series in 2002, and the first three talks, delivered by distin- 
guished colleagues from Max Planck Institutes (MPIs) across Germany, each 
began with a variation of, “Ladies and gentlemen, what I’m going to talk about 
today has kept me busy for 20 years.” 

Ropers did not have 20 years. He had become director of the MPI Depart- 
ment of Human Molecular Genetics in Berlin when he was 51. He was fast 
approaching 60, and under German law he would be required to retire in 
5 years. Ropers knew that he would not have enough time to complete his work 
on the genetic underpinnings of early-onset cognitive disorders. He also knew 
that when he retired, the institute would dissolve his 70-member department, 
so his employees would be out of jobs. 

Ropers is hardly the only person to worry about retirement, which has become 
acharged issue in the sciences. The proportion of US National Institutes of Health 
(NIH) grants awarded to people over the age of 65 more than doubled between 
1998 and 2014, going from 4.8% of the grant budget to 12%. Out of concern that 
this greying scientific workforce is limiting the availability of grants and jobs for 
young scientists, the NIH this year proposed a solution: an ‘emeritus’ award to 
encourage senior scientists to wind down their research and hand projects over 
to junior faculty members. It was met with overwhelming disapproval. 


H ans-Hilger Ropers vividly recalls the moment when he realized that 
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Scientists argued that the proposal placed too much focus on 
age, and risked awarding grants for factors other than the merits of 
the research. “While I like the idea of transferring knowledge and 
resources, ifyou can't do so efficiently inside a typical 40-year career 
span, then why should you be given longer?” says Paul Brookes, 42, a 
biologist at the University of Rochester Medical Center in New York. 
“Nobody can argue that emeritus professors were somehow caught 
unaware that theyd have to retire some day.’ 

To explore the rules, culture and expectations of retirement, 
Nature contacted academics in the middle of making their exit. The 
issues faced by older scientists can vary dramatically by country, 
and there is no right way to retire, says Tania Saba, associate dean of 
graduate studies at the University of Montreal in Canada, who has 
studied retirement trends since 1993. “Some want to keep working; 
others would retire the next morning if they could” 


Fighting the clock 

After Ropers had swallowed his panic, he crafted a plan. He rushed to 
obtain a 3-year extension so he could work until he was 68 (a provision 
now available to most academics in Germany), and later a second, 
unusual extension to work to 71. But even so, uncertainty about his 
future affected his department. “All your co-workers realize that this 
guy is turning 65 ina couple years, so they start to look for alternatives: 
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Over his final 6 years, Ropers watched his department shrink from 
70 people to 25, then to 10. Ifhe had known from age 60 that he would 
be able to work for 11 more years, Ropers would have sought appropri- 
ate grants, and would have been able to retain faculty members and 
plan his retirement better. “It would have made a world of difference,’ 
he says. 

Mandatory retirement for public employees used to be the norm 
across Europe, and despite some changes, it maintains a foothold in 
countries including Germany, Sweden and Spain. Legal challenges 
calling the practice discriminatory have mostly been unsuccessful. 
In 2007, the top European court ruled that European Union (EU) 
countries can force workers to retire as a way to free up job places and 
reduce unemployment for younger workers. But there is no empirical 
evidence that such an approach works. In fact, studies suggest that 
higher employment for older people is often correlated with higher 
employment and higher wages for younger workers. 

Some people facing mandatory retirement choose to leave the 
country. That was the reason immunologist Klaus Rajewsky, 78, 

left the University of Cologne in Germany 


> NATURE.COM in 2001 to set up a lab in the United States. 
Fora podcast on “It’s not so easy, particularly for experimen- 
retirement, see. tal scientists to start a new lab at that stage,” 
go.nature.com/d2y5p2 he says. “So most people retire, or try to get 
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some prolongation where they are.” 

Ropers’ department officially closed in November 2014. His last 
EU grant will expire shortly, and the remaining lab space is now 
being refurbished for the director of a new department. “After that, 
I will still have a chair and a room,’ says Ropers. “That will be it, 


though.’ He still plans to publish what he calls the biggest paper of 


his career, with four years’ worth of data on the genetics of mental 
disabilities. He has also accepted a part-time, paid job as a clinical 


geneticist at the Institute of Human Genetics at the University of 


Mainz. Although still concerned about the Max Planck Society’s 
decision to close its only department for human genetics, he says 
that it is time to move on to other things, including an around-the- 
world trip with his wife. “The sun is shining,” Ropers says. “Life 
could be worse.” 


Changing course 

Japanese stem-cell biologist Norio Nakatsuji turned 65 this year, and 
just like in Germany, that means mandatory retirement. As his final 
grant runs out over the next 2 years, his 2 labs at Kyoto University 
will close and most of his 16 laboratory members must seek employ- 
ment elsewhere. 

Although he is leaving academia, Nakatsuji has big plans in busi- 
ness. As former director of the Institute for Frontier Medical Sciences 
at Kyoto University, he has spent a career deriving and distributing 
stem cells. During that time, he co-founded a biotechnology com- 
pany, ReproCELL in Yokohama, which made a splashy initial pub- 
lic offering in Japan in 2013. Using US$2 million of his earnings, 
Nakatsuji founded two stem-cell-related companies: a consulting 
firm that bridges academia and industry, and a biotech company that 
is developing drug-testing devices based on heart muscle derived 
from stem cells. “They were unexpected funds, so I can put my 
money towards being an entrepreneur,’ says Nakatsuji. “That would 
be impossible for an ordinary professor.” 

His move into business is unusual in Japan. It is much more com- 
mon for a retired academic to take a position at a private university. 
But this did not appeal to Nakatsuji. Such jobs, he says, “should be 
made open to young people” 


Japanese government policies have increased the number of 


PhD graduates by an estimated 6.2% per year since the 1990s. But 
the number of academic positions available to those graduates has 
remained stagnant. The pipeline narrows as soon as they graduate: 
in 2014 the Japan Society for the Promo- 
tion of Science, the country’s largest sci- 
entific funding agency, granted only 362 
out of 3,222 applications for postdoctoral 
fellowships across all fields. Most PhD 
graduates who do not get fellowships take 
on short-term jobs at universities or public 
research institutions. 

Making space for young scientists 
should be a priority in Japan, but it is a great 
challenge, says Osamu Terasaki, a physicist 
who worked in academia there for 36 years 
before taking on a professorship at Stockholm University, a position 
he retired from in 2010, at age 67. “In Japan, senior investigators keep 
so much power, and for a long time,” says Terasaki. “Young people 
should take over. It would make the situation more healthy.” 


Keeping busy 
The top task on Uta Frith’s post-retirement to-do list is to publish 
a graphic novel. 

Frith, a developmental psychologist, retired from University 
College London in 2006, at the mandated age of 65. Although she 
could have fought to get another grant and keep working, she opted 
instead to leave the lab. “I was so happy that I wouldnt have to apply 
for any grants ever again.” Frith gave up hands-on research but has 
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“As long as 
somebody is 
contributing, why 
force them to retire?” 


taken advantage of the time to write up some of her previous work: she 
has published 33 papers since 2007. Now 73, she calls her retirement 
the “best years of my life”. They are also some of the busiest. She has 
founded two networks for women in science and technology, called 
Science&Shopping and UCL Women; she collaborated with the BBC 
ona television documentary about autism and is working on another 
about obsessive-compulsive disorder; she chairs the diversity pro- 
gramme of the Royal Society; and she tweets regularly, to more than 
15,000 followers. Frith also travels with her husband, psychologist 
Chris Frith, to Aarhus University in Denmark for a month once a year, 
to advise students and teach. “I only do things I really feel passionate 
about. That's the beautiful luxury of retirement,’ she says. 

Frith’s current passion is her graphic novel. She and her husband 
won the French National Centre for Scientific Research’s 2014 Jean 
Nicod Prize for cognitive science and philosophy, and promptly spent 
the €25,000 (US$28,000) award on paying an editor and artists to help 
them develop the novel, which is about social cognition: the study of 
how the human brain processes and uses information about other 
people. The Friths hope that their book — which they will release in 
instalments starting as early as this year — will help them to share their 
research with the public. 

“We could never do this if we had to pursue paid work,’ says Frith. 
“Retirement is a time you can really have complete liberation from 
responsibilities and duties, and devote time to things you might 
otherwise never have done” 

Although mandatory retirement was phased out in the United 
Kingdom in 2011, its effects can still be felt, says Peter Lawrence, 
73, a developmental biologist at the University of Cambridge and 
an outspoken opponent of age discrimination. “Many regard people 
above retirement age as odd to still be working. You feel as if you're 
not entitled to work” 

A 2010 study from the UK Department for Work and Pensions 
found that, despite the lack of evidence, employers still believe that 
productivity declines with old age’. Many still hew to the traditional 
retirement age; the average age of retirement in the United Kingdom 
today is 64.7 for men and 63.1 for women, although it has been 
increasing. 


Following passions 
In the United States, working past traditional retirement age seems 
to be becoming the rule rather than the exception. 

Mandatory retirement was phased out of 
US law starting in the late 1970s, and was 
ultimately abolished in 1986, although aca- 
demic institutions had until 1994 to com- 
ply. This has resulted in ample opportunity 
to observe what happens in the absence ofa 
forceful shove out of the door: people work 
for longer (see ‘Retired yet?’). 

According to the National Science Foun- 
dation, the proportion of working scientists 
and engineers in the United States over the 
age of 50 has increased from 1 in 5 in 1993 
to 1 in 3 in 2010 (ref. 2). “As long as somebody is contributing sub- 
stantially — teaching, administratively, research-wise — why force 
them to retire?” asks John Dowling, a neuroscientist at Harvard 
University in Cambridge, Massachusetts, who will formally retire 
this June, at the age of 79. Although financial security may play a 
part, some certainly stay on for the love of science: 54% of employed 
people aged 65 or over say that they are working because they want 
to, not because they have to, according to the Pew Research Center’. 

But what if they no longer want to? Physicist Mark Adams voluntar- 
ily retired from the University of Illinois at Chicago at the age of 59, 
to take advantage of state pension rules. He feared that changes to the 
poorly funded system, which had been attacked by politicians, would 
leave him worse off. Ifit had not been for that, he would probably have 
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Retired yet? 


The US National Institutes of Health has witnessed a decline in the 
proportion of RO1 grants going to younger researchers, and a rise in 
those going to researchers aged 66 or over. 
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worked for another four years, he says. 

Adams also knew that his work would not languish when he retired. 
“T have thousands of colleagues,” he says, including international 
collaborators working on results from the now-decommissioned 
Tevatron proton-antiproton collider at Fermilab in Batavia, Illinois, 
and the Large Hadron Collider (LHC) at CERN, Europe's particle- 
physics lab near Geneva, Switzerland. The field, he says “has almost a 
corporate structure to keep experiments going”. 

Two years after his official retirement, Adams still uses his office at 
the university, where he is completing the analysis of data from one 
of the particle detectors at the LHC. But he no longer receives a pay 
cheque, and he recently transferred his last student to a colleague. 
He plans to ease out entirely soon, to devote himself full-time to 
QuarkNet, an educational programme that he joined 14 years ago 
to bring high-energy physics experiments to schools in the Chicago 
area. “I feel like that’s a real volunteer thing I can do with my skill 
set, something that makes a difference. That’s where I want to put 
my time” 


Letting go 

Héléne Delisle also wanted to accomplish things outside academia. 
She had studied international nutrition at the University of Montreal 
for 29 years. But in January 2014, when she turned 70, she informed 
her department director that she would be retiring towards the end 
of the year. 

Like the United States, Canada no longer enforces retirement: 
mandatory retirement was abolished province by province between 
1973 and 2009. So Delisle was free to work for as long as she desired, 
and eventually that desire waned. Her husband had fully retired 
from medical practice by 2013, and it became apparent that they 
would have a better lifestyle together if they were both retired, pur- 
suing new endeavours together rather than trying to plan around 
her career. 

Delisle remained fully active right up to her official retirement in 
September, mainly finishing reports for a six-year nutrition project 
in West Africa. Since then, she has been transferring the leadership of 
a World Health Organization Collaborating Centre that she headed 
to a colleague, has served on several scientific committees, and has 
joined the editorial board ofa nutrition journal. She visits her office 
at the university weekly to see students who are wrapping up their 
graduate work, and those regular visits have made the transition 
smoother. “In a way, retirement is a separation, a severance, because 
you used to go to work every day and have plenty of activity and 
hectic business travel, she says. “You've got to let that go, and that 
can be difficult” 
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Most countries have an official retirement age of 60-65, and some 
enforce mandatory retirement for public workers. But the average age 
at which workers effectively leave the workforce varies dramatically. 
75 
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Most of all, Delisle is finally making space for other pursuits, fore- 
most among them social activism, and spending more time with 
her husband and her piano. Delisle has played in a chamber-music 
trio for years, and looks forward to devoting more time to concerts. 

Although there are few universal lessons to be drawn about how 
to wind down one’s career, most researchers abhor the mandatory 
system. Delisle and Adams say that it was important to make their 
own decisions about retirement on the basis of their desire and pro- 
ductivity, and not have the decisions made for them because of their 
age. “The ideal would be that no one checks how old you are but just 
looks at what you're doing and what you are able to do,’ says Ropers. 
“But that idea hasnt pervaded into public routine” 

In fact, pushing senior researchers out may be doing more harm 
than good. “There is no evidence that shows that early retirement can 
reduce unemployment, particularly youth employment,’ says Saba. 
Most studies show the opposite: an analysis from the nonprofit US 
National Bureau of Economic Research in Cambridge, Massachusetts, 
for example, found that paying senior individuals to retire actually 
increases the unemployment rate for younger workers, because older 
people remain productive and spur the creation of new jobs’. Research 
suggests that there is not a fixed number of jobs in the economy, so the 
idea of one-to-one replacement is false. For example, women did not 
replace men in the workforce in the late twentieth century; on the con- 
trary, two-earner families increased disposable income and prompted 
job creation. But how this dynamic might differ in academia, which 
is dominated by tenure and the pressures of limited funding, has not 
been fully analysed. 

Still, most senior scientists interviewed for this article emphasized 
concern for those in the next generation and felt they should be mak- 
ing room for them. “It’s important to give young people the opportu- 
nity to do research,’ says Dowling. 

But no matter how or why you go about it, do not think of retire- 
ment as an end, says Frith. “Stopping doesn’t mean stopping. It 
means do what you've always wanted to do.” = 


Megan Scudellari is a science journalist based in Boston, 
Massachusetts. 
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EQUALITY 


After years of second- 
class status in research 
partnerships, African 
scientists are calling 
for change. 


BY LINDA NORDLING 
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r | The gavel fell at precisely midday on 
18 July 2014. Ruling on a lawsuit lodged 
against the Kenya Medical Research 

Institute (KEMRI), the Nairobi Industrial 

Court agreed that six Kenyan doctors in an 

international research partnership had been 

systematically passed over for promotion and 
training, whereas their European colleagues 
had flourished. 

It was a landmark case — and not just 
because the judge ordered KEMRI to pay 
each of the doctors 5 million Kenyan shillings 
(US$56,000) to compensate for their stymied 
careers. It was also perhaps the first time that 
African researchers had so strongly — and 
so publicly — voiced resentment of their 
perceived second-class status in partnerships 
with foreign colleagues. 

Collaborations have proliferated in recent 
decades as international agencies have stepped 
up funding for health research in Africa. Yet 
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African scientists say that they often feel 
stuck in positions such as data-collectors and 
laboratory technicians, with no realistic path 
to develop into leaders. 

“T think it’s a big problem,” says Marcel 
Tanner, director of the Swiss Tropical and 
Public Health Institute in Basel. And when 
partnerships fail, he says, it is often the people 
from developing countries who have the most 
to lose. “They don’t have the support to see them 
through changes of funder or projects.” This 
happened, for example, in 2010, when French 
backers had to withdraw from a malaria-control 
project in Céte dIvoire because of civil war. As 
aresult, African staff members were left with a 
half-built laboratory, no way to pay the people 
they had employed and no funds to continue 
the project. 

To try to give African scientists more 
independence, several global funders are testing 
ways to build research leadership in Africa, and 
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are transferring some ownership of their own 
projects there to local scientists. In the works 
for later this year is a ‘fairness index, in which 
institutions and funders will earn certificates for 
engaging in equitable partnerships. 

But there is still a long way to go, says Glenda 
Gray, president of the South African Medical 
Research Council (SAMRC) in Cape Town. No 
matter how hard African scientists struggle for 
control of the research agenda, she says, they 
will not fully succeed until their own govern- 
ments start to pay their share. International 
funders do sometimes still act as “neighbour- 
hood bullies” in Africa, she says — “but it’s easy 
to be strong when you have money”. 


SWIM AGAINST THE TIDE 
The roots of these unequal partnerships lie in 
how modern research began in Africa — gener- 
ally with European nations that opened research 
stations in their colonies to study economically 
important issues, such as tropical diseases, crops 
or agricultural practices. As the centres grew, 
Africans were recruited first as assistants and 
then as scientists. But even after countries such 
as Tanzania and Kenya became independent in 
the 1960s, the most influential research centres 
there were often funded, led or at least heavily 
influenced by partners in wealthier countries. 

By the mid-1990s, however, researchers 
both in Africa and in the developed world were 
voicing concerns about ‘helicopter science’ 
or ‘sample safaris, in which foreign scientists 
were coming to Africa and collecting samples 
— often with local scientists’ help — before 
going home to analyse them and write up their 
findings, giving little or no credit to their Afri- 
can counterparts. In 2001, for example, Kenyan 
researcher Moses Otsyula claimed that scientists 
from the University of Oxford, UK, stole blood 
samples that he had collected from unusually 
HIV-resistant children at a Nairobi orphanage. 
The Oxford scientists called it an inadvertent 
error of miscommunication, saying that they 
thought they had permission at the time. 

These misunderstandings are becoming rare 
as foreign scientists grow more sensitized to 
ethical issues, and as African scientists realize 
the need for clearer rules governing sample col- 
lection and export. Yet inequitable partnerships 
remain a problem. In a survey published last 
year, the European Association of Development 
Research and Training Institutes in Bonn, 
Germany, found that the role of partners 
in Africa and other developing regions was 
“often still primarily limited to collecting 
data’, whereas partners from the developed 
world played a leading role in analysing and 
publishing the findings (see go.nature. 
com/42xjns). Some of the people interviewed 
for the study said that, even for developed-world 
scientists who want to build more-equitable 
partnerships, the mounting pressure to publish 
articles to advance their careers often leaves 
them no time to do so. 

Many African scientists have had personal 


experience with problematic partnerships. 
Some describe being shut out of collaborations 
that they helped to set up, whereas others talk 
about running projects on the ground as 
their developed-nation colleagues fly in and 
out — often combining a little work with a 
lot of tourism. But few are prepared to make 
their stories public because that might mark 
them as troublemakers and cut them off from 
future funding. 

“There are good and bad relationships, and 
all of us have had them,” says Gray. Although 
her council spends roughly US$9 million a 
year on health research — one of the biggest 
such budgets on the continent — that figure is 
only about one-fifth of what the US National 
Institutes of Health spends in South Africa. 

Still, she says, overseas funders are increas- 


“NOBODY WANTS TO BE A 
TOKEN SCIENTIST.” 


ingly willing for African agencies such as hers 
to influence the agenda. The SAMRC has made 
steps towards that. In 2013, for example, it began 
to combine its funding with that of international 
donors to create joint-funding pots in which 
both groups have a say in what they support. 

There has definitely been progress in recent 
years, agrees Kevin Marsh, who until last year 
headed a partnership between KEMRI and the 
UK Wellcome Trust, known as KWTRP. Dur- 
ing his tenure, he battled fiercely with funders 
to improve the conditions for Africans who are 
working for the collaborative programme. At 
one point, Marsh says, he nearly resigned when 
the funders did not want to provide equal 
salaries to UK and African staff. 


ALONG ROAD 
But even though there are many more 
prominent African researchers today than there 
were 20 years ago, he says, it is important not to 
overstate the gains. That is why he is helping to 
set up the Alliance for Accelerating Excellence 
in Science in Africa (AESA), a research-man- 
agement hub in Nairobi, that African presidents 
will launch in June. Designed to manage outside 
funding for programmes in Africa, AESA has 
start-up support from several major organiza- 
tions, ranging from the KWTRP to the policy- 
making body New Partnership for Africa’s 
Development, headquartered in Pretoria. 
Another project is the COHRED Fairness 
Index, which takes its name from its lead- 
ing sponsor, the Council on Health Research 
for Development (COHRED) in Geneva, 
Switzerland. The index will permit institutions, 
funders or other groups involved in research to 
receive certification if they engage in good part- 
nerships. Organizations can use this certifica- 
tion asa label to signify their status, in the same 
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way that manufacturers put symbols on their 
goods to mark them as eco-friendly. A funder 
might get high marks if its programmes target a 
developing-country’s national research priori- 
ties or its burden of disease, says Najia Musolino, 
a senior specialist with COHRED who is 
working on the index. Conversely, a health or 
science ministry in a developing country might 
get a bad rating if it does not regularly set or 
update national research priorities. 

The index is expected to enter a trial period 
towards the end of this year and, if successful, 
could be extended to research areas other 
than health. Musolino says that funders and 
academic institutions in Africa and abroad 
have offered cautious expressions of support, 
and think that the index could help African 
researchers to choose good partners. 

Gray says that the index will succeed only if 
developing-country researchers get a voice in 
how it works — and if African countries start 
to put more money into health research, so 
that their scientists have a stronger base from 
which to negotiate partnerships. That argu- 
ment resonates with other African scientists, 
who want nothing more than to be accepted as 
equals. “Nobody wants to bea token scientist,” 
says Kenyan immunologist Faith Osier, who 
works with the KWTRP. “It’s true that African 
scientists face disadvantages,” she says. But at 
the end of the day, she wants her contributions 
to be judged on merit — not on anything else. 

Some African scientists think that it is wrong 
to place all the blame for inequalities in Africa's 
research landscape on international funders. “It 
is pointless to say to the United States that ‘you 
should fund our scientists, but we should tell 
you what you should do. That, to me, is hypo- 
critical and disrespectful, of not just the funders, 
but also of their own local scientists,’ says Salim 
Abdool Karim, director of the Centre for the 
AIDS Programme of Research in South Africa 
in Durban. “What we need is for African sci- 
entists to understand, appreciate and promote 
excellence as a mechanism by which they can 
deal with international researchers on a collegial 
basis, not on a neocolonial basis.” 

Many of the affected researchers agree with 
Gray that a key missing factor is national 
government commitments to science. Despite 
promises of increased funding by many 
African governments over the past decade — 
in Kenya, Nigeria, Tanzania and South Africa 
among others — few have fulfilled their 
stated ambitions. “We should look critically 
at ourselves first and foremost before we start 
blaming funders or European universities,” 
says Kelly Chibale, a Zambian biochemist who 
heads a drug-discovery unit at the University 
of Cape Town, South Africa. “Unless — and 
until — we create good local infrastructure 
and a supportive local environment, we will 
continue to struggle” = 


Linda Nordling is a freelance journalist based 
in Cape Town, South Africa. 
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Climate advisers must 
maintain integrity 


As global negotiations fail on emissions reductions, scientific advisers 
need to resist pressure to fit the facts to the failure, warns Oliver Geden. 


isenchantment has set in well ahead 
D of the 21st Conference of the Par- 

ties (COP21) to the United Nations 
Framework Convention on Climate Change 
(UNFCCC) in Paris in December. Scientists, 
policy-makers and the public already accept 
that progress will not be enough to keep 
global average temperature rise within the 
2°C limit set at the 2010 UN climate summit. 


The negotiations’ goal has become what is 
politically possible, not what is environmen- 
tally desirable. Gone is a focus on establish- 
ing a global, ‘top down target for stabilizing 
emissions or a carbon budget that is legally 
binding. The Paris meeting will focus on 
voluntary, ‘bottom up commitments by 
individual states to reduce emissions. 

The global climate target is being watered 
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down in the hope of getting any agreement 
in Paris. The 2°C warming limit need only 
be kept ‘within reach. The possibility of 
using ‘ratcheting mechanisms’ keeps hopes 
alive of more-ambitious policies, but such 
systems are unlikely to achieve the desired 
outcomes’. Strict measuring, reporting and 
verification mechanisms are yet to be agreed. 
There is another casualty: scientific 
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> advice. Climate scientists and economists 
who counsel policy-makers are being pres- 
sured to extend their models and options for 
delivering mitigation later. This has intro- 
duced dubious concepts, such as repaying 
‘carbon debt’ through ‘negative emissions’ to 
offset delayed mitigation — in theory. 

Scientific advisers must resist pressures 
that undermine the integrity of climate 
science. Instead of spreading false opti- 
mism, they must stand firm and defend 
their intellectual independence, findings 
and recommendations — no matter how 
politically unpalatable’. 


POLITICAL WEATHER 

Climate researchers who advise policy- 
makers feel that they have two options: be 
pragmatic or be ignored. They either dis- 
tance themselves from the policy process by 
declaring that it is no longer possible to stay 
within a 2 °C-compatible carbon budget, or 
they suggest practical ways to dodge carbon- 
budget constraints’. 

Many advisers are choosing pragmatism. 
This can lead to paradoxical positions, as 
exemplified by shifting assumptions in 
climate economics over the past few years. 

Each year, mitigation scenarios that 
explore policy options for transforming 
the global economy are more optimistic* — 
and less plausible. Advisers once assumed 
that the global emissions peak would have 
to be reached before 2020 and that annual 
emissions-reduction rates of more than 3% 
were not feasible. Those assumptions keep 
changing. 

For example, the fourth assessment of the 
Intergovernmental Panel on Climate Change 
(IPCC), published in 2007, stated that emis- 
sions must peak by 2015 to stay within 2°C of 
warming; yet the fifth IPCC report, released 
last year, refers to 2030 emissions levels higher 
than today’s that are still compatible with this 
limit, albeit with annual emissions-reduction 
rates of 6%. The annual Emissions Gap 
Report by the United Nations Environment 
Programme had an original deadline of 2020 
for its analysis of how to fill the gap between 
global emissions levels compatible with a 2°C 
target and national pledges; the 2014 edition 
extended it to 2030. 

In both cases, climate economists got 
around past ‘make-or-break’ points for the 
2°C target by adding ‘negative emissions’ — 
the removal of greenhouse gases from the 
atmosphere during the second half of this 
century. Most models assume that this can be 
achieved using a combination of approaches 
known as BECCS: bioenergy (which would 
require 500 million hectares of land — 
1.5 times the size of India)* and carbon cap- 
ture and storage, an unproven technology. 

The ‘carbon debt’ to be paid back later 
can be substantial — sometimes doubling 
the remaining emissions quota originally 
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set by scientists to stay within the 2°C limit’. 

Meanwhile, policy-makers are delighted 
to hear that despite 20 years of mounting 
emissions, the 2 °C target is still theoreti- 
cally within reach. They ignore the fine print 
of the IPCC reports, to climate scientists’ 
increasing dismay. 

That said, there are positive signs of 
researchers holding their ground. For 
example, the Working Group II contribu- 
tion to the fifth 


IPCC report, which “Policy-makers 


assesses mitigation view the IPCC 
options, rejected reports mainly 
thetaskassignedto as asource 

it bythe UNFCCC: of quotes 

that of evaluat- with whichto 
ing the adequacy legitimize their 
of a 2°C target preferences.” 


compared to one 

of 1.5°C. The IPCC turned the question 
straight back to policy-makers, saying that 
target-setting is too dependent on ethical 
and value judgements*. 

This principled stance could pave the way 
for a weaker climate objective. By saying that 
science is incapable of judging levels of risk 
avoidance or intergenerational fairness, the 
IPCC might dilute its influence in the global 
target-setting process. But if scientific advis- 
ers were to refrain from demanding exact 
stabilization targets, the UNFCCC would 
no longer be able to justify a global political 
agreement with a simple ‘science says so’. 


REALITY CHECK 

Climate-science advisers should use the time 
before Paris to reassess their role. Do they 
want to inform policy-makers or support the 
political process? The climate policy mantra 
— that time is running out for 2°C but we 
can still make it if we act now — is a scientific 
nonsense. Advisers who shy away from say- 
ing so squander their scientific reputations 
and public trust in climate research*. 

The scientific community must defend its 
independence from outside interference — 
from government administrations and non- 
governmental organizations attempting to 
woo scientists to back their ‘just’ causes, and 
from climate-change deniers. Inviting non- 
scientific stakeholders from business, govern- 
ment and civil society to influence research 
agendas’, as happens in European sustainabil- 
ity research, must be reconsidered to curtail 
the politicization of science. 

At the same time, climate scientists should 
not overestimate their influence. They need 
a realistic picture of their target audience 
and to better understand how their exper- 
tise is used. In governments and parlia- 
ments, consistency of talk, decisions and 
action is more the exception than the rule. 
The value of expert knowledge depends 
mainly on its utility. Dealing with prob- 
lems is more important than solving them; 
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intentions are more important than results”. 

Everyday politics is therefore dominated 
not by evidence-based policy-making but by 
attempts at ‘policy-based evidence-making: 
For instance, policy-makers view the IPCC 
reports mainly as a source of quotes with which 
to legitimize their preferences, a practice that 
the IPCC should on occasion discourage. 

Climate-policy advisers should align their 
expectations with those in other public-policy 
domains such as development, health, foreign 
and security policy’: scientific evidence is just 
one of many factors affecting political deci- 
sions, such as voter preferences, available 
funds, competing interests and sheer prag- 
matics. The best that scientists can hope for 
is ‘evidence-informed policy-making: They 
are not, after all, democratically elected. 

Scientific advisers should resist the temp- 
tation to be political entrepreneurs, peddling 
their advice by exaggerating how easy it is to 
transform the economy or deploy renewable 
technologies, for instance. Their task is to ana- 
lyse critically the risks and benefits of political 
efforts and contribute empirically sound — 
and sometimes unwelcome — perspectives to 
the global climate-policy discourse. m 


Oliver Geden is head of the EU Research 
Division at the German Institute for 
International and Security Affairs in 
Berlin, Germany. He previously worked in 
the policy-planning units of two German 
government ministries. 

e-mail: oliver.geden@swp-berlin.org 
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CORRECTION 

The Comment ‘Five priorities for the UN 
Sustainable Development Goals’ (Y. Lu 
et al. Nature 520, 432-433; 2015) gave 
an incorrect affiliation for co-author 
Nebojsa Nakicenovic. He is deputy 
director-general of the International 
Institute for Applied Systems Analysis in 
Vienna, Austria. 


Bees in Ibn Butlan’s eleventh-century Maintenance of Health, published in Italy as the Tacuinum Sanitatis. 


APICULTURE 


Telling the bees 


John Burnside reflects on the role of art and myth 


in the health of the hive. 


most of all, industry lie at the heart 

of bee mythology. Hence, for exam- 
ple, this image in philosopher Bernard de 
Mandeville’s The Fable of the Bees (1714): 


S== orderliness, productivity and, 


A Spacious Hive well stockt with Bees, 
That livd in Luxury and Ease; 

And yet as famd for Laws and Arms, 
As yielding large and early Swarms; 
Was counted the great Nursery 

Of Sciences and Industry. 


As the Industrial Revolution unfolded, 
this anthropomorphic and mechanistic view 
of bees as little machines of honey produc- 
tion came to obscure their importance as the 
means through which more than 30% of our 
food crops and 90% of the world’s wild flora 
survive. “Every third bite of food you take, 
you can thank a bee or other pollinator for,” 
as biologist E. O. Wilson, doyen of the social 
insect, notes in his introduction to Stephen 
Buchmann and Gary Nabhan’s The Forgotten 
Pollinators (Island, 2012). 

For almost as long as humans have culti- 
vated the land, we have kept honeybees (Apis 
mellifera). The ancient Egyptians used honey 


in their rites for the dead; the Bible, the Hindu 
Vedas and the Koran all make reference to the 
healing qualities of honey. Today we are faced 
with widespread colony collapse disorder, in 
which commercial hives die off within weeks. 
According to one survey, “between 0.75 and 
1.00 million honey bee colonies are estimated 
to have died” in the United States over the 
winter of 2007-08 alone (D. vanEngelsdorp et 
al. PLoS ONE 3, e4071; 2008); that is, 30-40% 
of the total. Suggested culprits range from 
climate change to the varroa mite — which 
transmits deformed wing virus, among other 
conditions — and pesticides such as neo- 
nicotinoids (see Nature 520, 416; 2015). 

The causes are not yet clear, but in my view 
the myth of the indefatigable honeybee has 
played a part. What is happening serves to 
highlight how dangerous our stories about 
other animals can be, if they warp our under- 
standing of the animals’ real nature and needs. 

In La Vie des Abeilles (The Life of Bees, 
1901), Belgian playwright and nature writer 
Maurice Maeterlinck 
tells how apiculture ONATURE.COM 
saw a revolution in  Formoreonbees, 
capacity and produc- _ visit: 
tion with Venetian — go.nafure.com/2mn9yk 
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officer Francesco de Hruschka’s invention 
of the centrifugal honey extractor in 1865. 
Humans, Maeterlinck notes, became “furtive” 
masters of the insects: “For the destiny once 
imposed by the seasons he has substituted his 
will ... He will five or six times in succession 
deprive the bees of the fruit of their labour.” 

Even Maeterlinck, with his inflated regard 
for the human will, did not foresee the man- 
agement regimes that are now common in 
industrialized honey production. Bees are 
consistently deprived of the fruit of their 
labour, routinely being given corn syrup in 
place of their honey. Yet research indicates 
that a key ingredient in the bee's natural diet 
is p-coumaric acid, a compound that triggers 
the expression of detoxification genes and 
is present in honey, but not in corn syrup 
(W. Mao et al. Proc. Natl Acad. Sci. USA 110, 
8842-8846; 2013). Another crucial factor 
might be propolis, a sticky substance that 
bees gather from sap, and a known modu- 
lator of their immune systems. The hive is 
not only a residence and honey factory; it 
is a sophisticated regulatory system for its 
busy denizens. 

Alert to the plight of pollinators, writers 
and artists have begun retelling the bee's story. 
Composer Kelvin Thomson and entomolo- 
gist Mark Brown, for instance, premiered 
their one-act opera The Silence of the Bees in 
2013. Some poets have taken their cue from 
Henry David Thoreau’s observation that the 
keeping of bees is “like directing the sun- 
beams” — good practice should be a matter 
of “very slight interference”. 

We need these new stories: as long as the 
old myth of inexhaustible productivity domi- 
nates popular thinking and commercial prac- 
tice, we risk continuing destructive regimes. 
One of the ways in which poets, for example, 
have restored the idea of slight interference 
has been to recall beekeeping practices that 
once helped to maintain the hive'’s well-being. 

One was the ancient folk tradition of ‘tell- 
ing the bees’ about important events, includ- 
ing when their keeper died. This observance 
elegantly reinforced the sense that bees were 
not just a resource, but integral to human 
communities. US poet John Greenleaf 
Whittier was already writing about such 
rituals in 1858; the speaker of Jo Shapcott’s 
‘T tell the bees’ (2010) enacts a new version of 
the rite, as an abandoned woman takes over 
the hives left behind by her lover: 


I went outside when the sun rose, whistling 

to call them out as I walked towards the hive. 

I pressed my cheek against the wood, opened. 

my synapses to bee hum, I could smell bee 
hum. 

‘It’s over, honies; I whispered, ‘and now 
youre mine: 


Sean Borodale’s lyric chronicle of his 
first year as a beekeeper, Bee Journal > 
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> (Jonathan Cape, 2012) goes for 
close day-to-day observation, in poems 
“written at the hive wearing a veil and 
gloves” that express a wonderfully 
detailed and subtle appreciation of 
honeybee life: 


How bees touch and re-align their touch. 
Light migration; 
noise of a body in continual repair 


This is one vital function of art in our 
lives: it restores our sense of wonder, 
and so increases our respect for other 
life forms. Yet writers and artists can 
also actively contribute new knowledge. 
One example of this is the work of visual 
artist Amy Shelton, whose engagement 
with bees and beekeeping over a number 
of years led her to set up the Honeyscribe 
project in 2011. 

In ancient Egypt, “honeyscribes’ 
recorded the productivity of the hives. 
Shelton goes further, charting in multi- 
media artworks threats to honeybee 
health and reflections on the species’ 
behaviour. Her project, she writes, 
“emphasizes communication, diver- 
sity and collaboration” in our shared 
environment. “The beehive reflects 
the flora, the temperature and the pes- 
ticides present in the environment 
within which it is situated, amalgamat- 
ing these things into one vastly com- 
plicated self-regulating organism” (see 
www.amyshelton.co.uk/art_works). 

I was fortunate to collaborate with 
Shelton on the artist’s book Melis- 
sographia, which sets a series of poems 
drawing on Maeterlinck’s study along- 
side embossed, hand-painted pollen 
maps. These, as Shelton puts it, reference 
“a selection of seasonal pollen loads col- 
lected by the honeybee from single plant 
species” and “tiny botanical samples 
of flowers collected over the apiarist’s 
calendar year, which are important to 
sustaining the health of the honeybee”. 

Besides making her own extraordinary, 
often moving pieces, Shelton aims to fos- 
ter a dialogue on bees between apiarists, 
scientists, writers and artists, children 
and the public by means of art, work- 
shops and events. Hopefully, through 
this and other projects, we will continue 
to create less-mechanistic stories about 
A. mellifera. In so doing, we will evolve 
better practices in our dealings with a 
creature that is essential to our agricul- 
ture and our culture. m SEE NEWS & VIEWS P.38 


John Burnside is a poet and novelist, 
nature writer and former computer systems 
analyst. His most recent poetry collections 
are Black Cat Bone and All One Breath. 
e-mail: jb44@st-andrews.ac.uk 
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The 40,000-year-old ‘Lyuba’ is one of the best-preserved woolly mammoths ever found. 


A behemoth revived 


Henry Nicholls examines a clear appraisal of what it 
would really take to resurrect extinct species. 


hazard of studying an extinct 
Aisi species such as the 
woolly mammoth is that you spend 
a lot of time answering the same question: 
“Ts it possible to clone it?” For evolutionary 
molecular biologist Beth Shapiro, who has a 
long-standing interest in both ancient DNA and 
mammoths, the solution was to write a book. 
In How to Clone a Mammoth, Shapiro 
has an alluringly simple goal: “to provide a 
road-map for de-extinction” in a single vol- 
ume “that separates science from science 
fiction”. She begins by considering which 
species might be targeted for de-extinction, 
anticipating many of the difficulties to come. 
The poor preservation of dodo remains, for 
instance, means that any DNA fragments 
recovered are unlikely to give a clear idea of its 
genome. The moa, a giant flightless bird from 
New Zealand, is probably out on the basis 
that its closest living relative (the tinamou) is 
not close enough to assist in assembling its 
DNA. Steller’s sea cow (Hydrodamalis gigas) 
has no living surrogate that could accommo- 
date a cloned fetus, and the natural habitat of 
the Yangzte River dolphin (Lipotes vexillifer; 
widely thought to be extinct) is currently too 
polluted to receive the cetaceans. 
Shapiro contends that the focus on indi- 
vidual species is misguided. “De-extinction 
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has a place in our 
scientific future, but 
not as an antidote to 
extinctions that have 
already occurred,” 
she writes. Engineer- 
ing extinct traits into 
living organisms, how- 
ever, could help those 
organisms to adapt to 
environmental shifts, 
and could re-establish 
ecological interactions 
that disappear when a 
species goes extinct. In 
Shapiro's view, this is 
the real value of de-extinction technology. Yet 
the little that we currently understand about 
the operation of bygone genes means that her 
argument is necessarily vague on detail. 
Targeting the woolly mammoth (Mam- 
muthus primigenius), a keystone species of the 
steppe-tundra during the last glacial period 
(from roughly 110,000 to around 12,000 years 
ago), Shapiro begins the journey towards 
de-extinction, and hunts for a well-preserved 
specimen. One bizarre expedition to Sibe- 
ria ends with the juxtaposition of armed 
nomadic reindeer herders and a French cou- 
ple with a cooler box filled with cheese. But 


How to Clone 

a Mammoth: 

The Science of 
De-Extinction 
BETH SHAPIRO 
Princeton Univ. Press: 
2015. 


RIA NOVOSTI/SPL 


cells and DNA degrade rapidly after death, so 
itis unlikely that anyone will ever discover an 
intact mammoth nucleus that will allow the 
species to be rebooted with the cloning tech- 
nique of somatic-cell nuclear transfer. 

Breeding the mammoth back into existence 
through carefully arranged elephant back- 
crosses is also a non-starter, Shapiro shows. 
Elephants are tropical beasts with none of the 
adaptations to low temperatures that would 
need to be concentrated through artificial 
selection. The only realistic route, she con- 
cludes, is to engineer mammoth-like traits 
into an elephant stem cell. As sci-fi as this 
sounds, she predicts that mammoth revival- 
ists will have achieved it within a few years. 

Several hurdles remain, and Shapiro 
presents these clearly and entertainingly as a 
brilliant thought experiment at the bounda- 
ries of biological plausibility. The engineered 
stem cell would need to be cloned, the cloned 
embryo transferred to a surrogate or artifi- 
cial uterus, mammoth-like elephants reared 
in sufficient numbers in captivity, a suit- 
able habitat made available, and the public 
convinced of the benefits of releasing these 
genetically modified organisms. The beasts 
would then need to be set free. 

Given the astronomical odds against 
pulling off all these steps in succession, it 
is remarkable to discover that a respected 
scientist such as Shapiro is actively involved 
in projects to bring the woolly mammoth 
and passenger pigeon (Ectopistes migratorius) 
back from the dead. The idea of enriching 
biological diversity by engineering long-lost 
genetic innovations into extant species is 
certainly worth considering and not so very 
different from how we already engineer crops. 
Unfortunately, the ecological argument for 
mammoth de-extinction is distinctly woolly, 
based more on the appeal of rewilding than 
on clear experimental evidence of the role of 
large herbivores on the Arctic tundra. 

In the case of the passenger pigeon, the 
argument in favour of de-extinction is thin- 
ner still, and involves some seriously bird- 
brained ideas, such as the proposal to paint 
thousands of homing pigeons to look like 
passenger pigeons so that they can train up 
the resurrected birds for release. 

Shapiro acknowledges that there are prob- 
ably better cases for de-extinction, though 
she does not explore any in detail. This is 
a shame. The mammoth and passenger 
pigeon might be the perfect species to illus- 
trate the fraught, often contradictory logic 
of de-extinction, but it is in simpler, more 
tractable settings, where there are fewer 
mammoth leaps of faith, that de-extinction 
might become the “powerful new tool” for 
conservation that Shapiro predicts. = 


Henry Nicholls is a journalist based in 
London. His latest book is The Galapagos. 
e-mail: henry@henrynicholls.com 
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Books in brief 


The Upright Thinkers: The Human Journey from Living in Trees to 
Understanding the Cosmos 

Leonard Mlodinow PANTHEON (2015) 

Like Yuval Noah Harari’s Sapiens (Harvill Secker, 2014; see Nature 512, 
369; 2014), this is an audacious encapsulation of our species’ trek 
from savannah to city. But physicist Leonard Mlodinow dwells less on 
our aggression than on our curiosity. As he traces the human brain’s 
evolution over millions of years, the birth of science over hundreds, 
and the modern physics revolution over tens, a tale of hope and glory 
emerges, from the “leap in existential consciousness” at the Neolithic 
site of Gdbekli Tepe in Turkey to twentieth-century atomic epiphanies. 


The Weather Experiment: The Pioneers who Sought to see the Future 
Peter Moore CHATTO AND WINDUS (2015) 

Prepare for turbulence in this history of Britain’s seminal 
contribution to weather forecasting. Peter Moore reveals how the 
lack of early warning exacerbated the impact of events such as the 
Great Storm of 1703 — which blew cattle over hedges and spun 
windmills so fast that they burst into flames. The nineteenth-century 
arrival of Francis Beaufort (who quantified wind) and chemist Luke 
Howard (who classified clouds) was a turning point. But the star was 
‘Darwin’s captain’ Robert FitzRoy, who founded the UK forecasting 
system, but killed himself before he saw his vision realized. 


Misbehaving: The Making of Behavioural Economics 


: : 4 % »> : y Richard H. Thaler ALLEN LANE (2015) 
NEE Bal ‘ P » In Nudge (Yale University Press, 2008), co-authored with Cass 
yr ‘ : 7 Sunstein, behavioural-economics pioneer Richard Thaler revealed 
: : . : ‘Eile how human irrationality shapes markets. Here he relates the evolution 
ecg Berta of those ideas by way of eureka moments and buzzy collaborations. 
MISBEHAV a Puzzling over behaviours that failed to fit established economic 
Richard y vING models, he became galvanized by the work of psychologists Daniel 
Santer j Kahneman and Amos Tversky on heuristics and biases. As Thaler’s 
story of research into self-control and other neglected topics unfolds, 
his tussles with traditional economists inject plenty of zing. 
= = The Narrow Edge: A Tiny Bird, an Ancient Crab, and an Epic 
Journey 
Deborah Cramer YALE UNIVERSITY PRESS (2015) 
The Narr Edy The red knot (Calidris canutus rufa) is a migratory marathoner. The 
sandpiper flies nearly 30,000 kilometres per year, from Tierra del 
Fuego to the Arctic and back, stopping on the US East Coast to feast 
on horseshoe-crab eggs — but on the wing for up to 6,000 kilometres 
| | ata time. With the bird’s food sources and habitats now threatened, 
- ) writer Deborah Cramer travels with some of the dedicated 


researchers working along its flyway. An eloquent interweaving of 
history, field practice and keen personal observation. 


The Lost World of the Old Ones: Discoveries in the Ancient 
Southwest 

David Roberts W. W. NoRTON (2015) 

An enigmatic civilization — the Ancestral Puebloans, or Old Ones — 
once flourished in Arizona, Colorado, New Mexico and Utah. When its 
peoples disappeared in AD 1300, they left behind stunning rock art 
and dwellings sited like swallows’ nests on sheer canyon walls. David 
Roberts braids his own daredevil discoveries of remote sites into 

the stories of Southwestern archaeologists and ranchers who are as 
gripped by this lost culture of “genius climbers” as he is. Barbara Kiser 
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Forests: see the trees 
and the wood 


Silvano Fares and colleagues’ 
recommendations for 

managing Europe’s forests to 
promote resilience and carbon 
storage (Nature 519, 407-409; 
2015) seem to overlook the 
implications for forests as natural 
ecosystems and run counter 

to biodiversity sustainability 
guidelines (D. B. Lindenmayer 

et al. Biol. Conserv. 131, 433-445; 
2006). These must be taken 

into account if we are to meet 

the Convention on Biological 
Diversity’s Aichi targets and the 
European Union's 2020 target for 
halting biodiversity loss. 

For example, managing a 
forest’s composition of tree 
species in favour of productive, 
gale-resistant and disease- 
tolerant species or genotypes 
will be at the expense of 
hundreds of native woody 
species and their rich biota of 
insects and fungi. Harvesting 
trees at shorter intervals to 
promote carbon storage would 
endanger old forests and veteran 
trees, along with the birds, bats, 
beetles, fungi and lichens they 
support. And removing woody 
debris to prevent wildfires 
would kill the wealth of 
species that thrive on dead and 
decaying wood. 

Hans Henrik Bruun, Jacob 
Heilmann-Clausen University of 
Copenhagen, Denmark. 

Rasmus Ejrnees Aarhus 
University, Denmark. 
hhbruun@bio.ku.dk 


Forests: not just 
timber plantations 


In considering how to develop 
Europe’ forests sustainably 

(S. Fares et al. Nature 519, 
407-409; 2015), it is important to 
differentiate between intensively 
managed forestry areas and 
forests as natural habitats. 
Working with, rather than 
against, nature is more likely to 
offer sustainable solutions (see, 
for example, the forest work 


programme of the Convention 
on Biological Diversity: www. 
cbd.int/forest), especially in view 
of the potential effects of climate 
change on European forests. 

One of the targets in 
the European Union (EU) 
biodiversity strategy for 2020 
calls for forestry management 
practices to improve the 
conservation status of species 
and habitats. Current forestry 
practices all too often do just 
the opposite (see, for instance, 
D.B. Lindenmayer et al. Biol. 
Conserv. 131, 433-445; 2006). 

As the EU parliament debates 
a common forest strategy, we 
need to look beyond promoting 
intensive forestry for maximum 
timber yield under the flag of 
climate change. Forestry practices 
that preserve natural ecosystem 
processes are likely to be more 
effective in maintaining forests’ 
biodiversity and natural resilience 
against climate change (see, for 
example, T. Kuuluvainen and 
R. Grenfell Can. J. For. Res. 42, 
1185-1203; 2012). 
Bengt Gunnar Jonsson 
Mid Sweden University, Sundsvall, 
Sweden. 
Guy Pe'er Helmholtz Centre for 
Environmental Research — UFZ, 
Leipzig, Germany. 
Miroslav Svoboda Czech 
University of Life Sciences, Prague, 
Czech Republic. 
bengt-gunnar.jonsson@miun.se 


Research network to 
track alpine water 


The water cycle in alpine 
environments worldwide 
supplies fresh water to vast 
downstream areas inhabited by 
more than half of humanity. The 
International Network for Alpine 
Research Catchment Hydrology 
(INARCH) was launched this 
year by the Global Energy and 
Water Exchanges project of 
the World Climate Research 
Programme, to improve the 
predictability of water resources 
from mountainous regions (see 
www.usask.ca/inarch). 

Melting snow and ice are 
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altering hydrological systems and 
affecting the quantity and quality 
of water resources, as indicated in 
the 2014 Working Group II report 
by the Intergovernmental Panel 
on Climate Change. Insight into 
these processes has been limited 
by the difficulties of collecting 
data on the alpine water cycle in 
high-mountain research areas. 
Studies have been concentrated 
at few sites, with poor data 
comparison across mountainous 
regions. 

INARCH’s alpine catchments 
are well equipped to measure 
snow and ice hydrology. Such 
information can be used, for 
example, to investigate the 
effects of mountain snowpacks 
and glaciers on water supply, 
or to study variations in energy 
and water exchange in different 
high-altitude regions. The 
network will help to develop 
new observational methods, a 
measurement infrastructure and 
an international database to guide 
research and policy. 

John Pomeroy University of 
Saskatchewan, Canada. 
Matthias Bernhardt BOKU 
University of Natural Resources 
and Life Sciences, Vienna, Austria. 
Daniel Marks US Department of 
Agriculture Agricultural Research 
Service, Boise, Idaho, USA. 
matthias.bernhardt@boku.ac.at 


Botanists still need 
to tell plants apart 


I disagree with Isabel Marques’ 
frightening suggestion that 
botanical education should be 
‘modernized’ by moving away 
from the study of specimens 
towards a more molecular 
approach (Nature 520, 295; 
2015). Classical botanical 
disciplines such as systematics 
and taxonomy fail to attract 
students not because they lack 
molecular allure, but because of 
the time-consuming focus they 
demand. 

In any case, many botany 
students at universities worldwide 
are also trained in molecular 
biology, as at my institute, the 
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University of Pretoria in South 
Africa, and at the Ruhr University 
Bochum in Germany, for 
instance. Even ‘old-fashioned’ 
studies in the field of plant 
systematics rarely get published 
without molecular data, and these 
often include genomic data. 
Drawing botanical education 
away from the plant itself risks 
creating ‘experts’ who cannot 
reliably differentiate species. 
Marques call for outreach to help 
the public to identify common 
plants would then not stand a 
chance. 
Martin Kemler University of 
Pretoria, South Africa. 
martin.kemler@fabi.up.ac.za 


Interim initiative for 
health in Iran 


As physicians working in Iran, we 
agree that the lifting of economic 
sanctions cannot come soon 
enough for the nation’s health 
(M. Aloosh and A. Aloosh Nature 
520, 623; 2015). In the interim, a 
group dedicated to restoring Iran's 
medical science, public health and 
health diplomacy is launching a 
non-profit organization — the 
Iranian-American Partnership 
for Medical Science and Public 
Health Association (IAPA). 
The partnership is based in 
Switzerland and comprises US 
and Iranian volunteers (including 
myself). 

One of IAPAs first priorities 
is to seek relief for young cancer 
patients in charity hospitals 
where chemotherapy drugs are 
in desperately short supply (see 
S. Shahabi Nature 520, 157; 2015). 
A channel for such humanitarian 
support was established in 
November 2013 by the Joint Plan 
of Action between Iran and six 
other countries, in part to address 
the unwanted consequences 
of international sanctions (see 
go.nature.com/nr1sfn). 
Ali Akbar Velayati, Hamidreza 
Jamaati, Seyed Mohammadreza 
Hashemian Shahid Beheshti 
University of Medical Sciences, 
Tehran, Iran. 
iran.criti calcare@yahoo. com 
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EVOLUTION 


Beauty varies with the light 


Experimental work on guppies suggests that variation in light between microhabitats is what makes females prefer 
different male signal combinations, thus explaining the evolution and persistence of colour variation in males. 


OLE SEEHAUSEN 


ales of many animal species display 
Meessicen: signals as part of their 

courtship — bright colours and 
enlarged fins in fishes or colours and feather 
plumes in birds are familiar examples’. Yet 
these displays cost energy and expose males to 
predation, so they will, on average, shorten the 
males’ life. Charles Darwin identified this evo- 
lutionary paradox’, and evolutionary biologist 
Ronald Fisher developed a model to explain it’, 
arguing that if slight variation in a male trait 
that draws female attention initially coincides 
with variation in male vigour, this might trig- 
ger ‘runaway selection. Here, by means of a 
genetic correlation, both the male ornament 
and the female preference become exaggerated 
beyond the natural-selection optimum until 
natural selection and the benefit bestowed by 
sexual selection balance out. But this theory 
did not explain why considerable variation in 
male traits, which is widespread in nature’, 
should remain, because directional selection 
would be expected to deplete heritable varia- 
tion’. Writing in American Naturalist, Cole and 
Endler® suggest that environmentally depend- 
ent variation in signal transmission may be a 
contributing factor. 

The most intuitive explanation for the 
maintenance of conspicuous variation in traits 
under strong sexual selection is that trade-offs 
between natural and sexual selection allow for 
alternative ways to optimize fitness. However, 
surprisingly, this idea has rarely been substan- 
tiated’. More-empirical support has accu- 
mulated for the hypothesis that expression 
of ‘expensive-to-make'’ or ‘expensive-to-have 
signals depends on the heritable condition 
of the bearer. This theory may explain why 
variation persists in the face of strong selec- 
tion either when many genes contribute to 
male condition, creating a large mutational 
target for sexual selection (the idea of ‘genic 
capture’)®, or when the relative costliness of 
different signals varies between different 
genotypes or environments’. 

Cole and Endler propose a third mecha- 
nism, namely, that signal transmission and 
signal selection vary on very small spatial 
scales that affect individuals within a 
single population. The efficacy of animal and 
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Figure 1 | Spatial scale of light environments affects the outcome of sexual selection. Cole and Endler® 
show differing female preferences for male coloration patterns in different light environments. a, In 
natural guppy habitats — shallow waters with a fine-grained, mosaic light — this may contribute to the 
evolution and maintenance of colour variation within the male guppy population. b, In fish that occupy 
greater ranges of water depth, with defined light gradients, this effect may contribute to population 
divergence and speciation, such as is seen between different depths in cichlid fishes’. 


plant signals is a function of their transmission 
through the environment, their contrast against 
the environmental background and variables 
on the signal-receiver side'®. The appearance 
and perception of colour signals, for instance, 
depend heavily on the light in the surrounding 
environment, and many animal populations 
occupy heterogeneous environments in which 
light conditions vary. The most effective signal 
design may then also vary between individu- 
als depending on where and when they signal. 
Divergence in male signals between popula- 
tions occupying either opposite ends ofa light 
gradient or discrete habitats that differ in light 
conditions has been shown in other fishes, 
and this can be associated with species diver- 
gence, but these cases were also accompanied 
by divergence in the fishes’ visual systems’. 
By controlling for visual-system variation, Cole 
and Endler were able to measure how sexual 
selection on male colour pattern changes as a 
function of changes in the environment alone. 
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The authors studied guppies — small fish 
that are classic study objects in the evolutionary 
ecology of mate choice and sexual selection'*”. 
Guppy males are incredibly variable in their 
coloration and patterns, with males displaying 
markedly different combinations of chromatic 
(including orange, red, green or blue) and ach- 
romatic (silver, black or white) colours. Guppy 
females are plain yellow-grey for camouflage, 
and vary in their mating preferences for differ- 
ent male colour combinations”. 

By observing preferences for males in the 
same population of guppy females in water of 
four different colours, Cole and Endler esti- 
mated the attractiveness of alternative male 
patterns as a function of the environment. 
Rather than preferring individual colours, 
females exhibited preferences for groups of 
colours and for a different colour group in 
each environment. Female guppies seem to 
select entire colour patterns on the basis of 
overall visual contrast against the background. 


This implies that individuals or populations 
inhabiting different environments will be sub- 
ject to sexual selection on male colour pattern 
in different directions, without any evolved 
change in female preferences. 

The authors also report that the effectiveness 
of achromatic components of colour pattern 
changed much less with light environment than 
did that of chromatic colours. Although per- 
haps unsurprising, this observation allowed the 
authors to hypothesize that chromatic and ach- 
romatic colours evolve in qualitatively different 
ways, with the latter representing some form 
of contingency against environmental change. 
Species living in a range of light environments 
(generalists) would be more likely to evolve 
achromatic colours for signalling, whereas 
species that specialize on certain microhabi- 
tats would combine chromatic and achro- 
matic components. In both of these groups, 
achromatic colour elements might function 
in aspects of communication that must work 
across multiple light environments — for 
example, the light differs on the breeding and 
feeding grounds of many fishes. Consequently, 
the authors predict that the evolution of chro- 
matic and achromatic signals would show 
different levels of evolutionary conservation. 

These are interesting ideas for how micro- 
evolutionary processes (those within a popu- 
lation) allow predictions to be made about 
macroevolutionary (between-species) pat- 
terns. Although testing such predictions in 
guppies may be difficult owing to the absence 
of macroevolutionary radiations in this lin- 
eage™’, support comes from work on other 
fishes. In African cichlid fishes, of which there 
are large species radiations, chromatic colours 
tend to vary between closely related species, 
whereas arrangements of black stripes and 
bars are more strictly conserved" (Fig. 1). And 
evolutionary diversification into many spe- 
cies that have different chromatic male court- 
ship colours is associated with microhabitat 
specialization, whereas such diversification is 
not typically seen in habitat generalists. 

The comparison between guppies and cichlids 
is interesting from yet other perspectives. 
Guppies are one of the few animals in which 
the genetic correlation between female prefer- 
ence and male trait, theoretically predicted by 
Fisher* and by most models of sexual selection, 
has been demonstrated”. One would expect 
this co-evolution between the sexes to lead, 
at least sometimes, to behaviourally isolated 
species, but this has almost never happened 
in guppies. Ecological constraints'* and con- 
tinued gene flow asa result of male coercion”® 
are two possible explanations, but implicit in 
the work of Cole and Endler is a third one: 
that the co-evolutionary mosaic between male 
colour and female preference is often medi- 
ated by environmental variation in light on so 
small a spatial scale that it facilitates individual 
variation within populations but no divergence 
between populations (Fig. 1). Guppies live in 


shallow waters of forest streams, where bright 
spots of sunlight alternate with canopy shade, 
which is depleted in red and blue light but rich 
in green and yellow. Individual guppies move 
about in this mosaic of light, and this facilitates 
the evolution of large colour variation between 
individuals within a population (called colour 
polymorphism) that is associated with indi- 
vidual light preferences. Cichlid populations 
in African lakes experience variation in light 
ona very different spatial scale, mediated by 
water depth, and these long and broad light 
gradients facilitate the divergence of entire 
populations across depths, eventually leading to 
speciation”. It seems that it is the relationship 
between the spatial scales of light heterogen- 
eity and individual movement that determines 
whether sexual selection will maintain sexual- 
signal variation within populations or drive 
loss of variation within populations, divergence 
between populations and speciation. = 
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| CANCER | 
Antibodies regulate 
antitumour immunity 


Boosting the T cells that mediate anticancer immune responses is a therapeutic 
goal. But T cells do not work alone — B cells and the antibodies they produce can 
both trigger and suppress the response. SEE LETTERS P.94 & P.99 


LAURENCE ZITVOGEL & GUIDO KROEMER 


Ithough cancers originate from cell- 

intrinsic genetic mutations, it has 

become clear that for malignant 
tumours to progress in an unrestrained fashion 
they must elude or subvert the host's immune 
system’. Asa result, the best, and perhaps the 
only, option for curing cancer lies in enhan- 
cing these immunosurveillance mechanisms. 
This can be achieved by immunotherapies — 
treatments specifically designed to stimulate 
anticancer immunity” — and through chemo- 
therapies and radiotherapies that mediate their 
long-term effects by eliciting anticancer immu- 
nity’. Natural anticancer immune responses, 
as well as many forms of cancer immuno- 
therapy, depend on the activity of T cells that 
recognize tumour-specific molecules (anti- 
gens)’; only rarely have anticancer effects 
been ascribed to the antibody-producing 
B cells of the humoral immune system’. 
But in this issue, Carmi et al.° (page 99) and 
Shalapour et al.’ (page 94) reveal the potential 
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of humoral immune responses to positively 
and negatively regulate T-cell-based anticancer 
immunity. 

Animal studies have revealed that tumours 
from one individual typically do not grow 
following transplantation into another indi- 
vidual of the same species, in much the same 
way that transplanted organs are rejected by 
the recipient’s immune system. Carmi et al.° 
reasoned that understanding this tumour 
rejection might provide clues to harnessing 
the natural antitumour immune response. 
They studied the reaction to tumours trans- 
planted from mice of one strain into a strain 
that had the same genes encoding the major 
histocompatibility complex (MHC) mol- 
ecules, which are the strongest determinants 
of transplant rejection, but that were otherwise 
genetically distinct. Rejection of these allo- 
geneic tumours was triggered by antibodies 
that bind to tumour antigens through the anti- 
body’s ‘variable’ region and interact with the 
Fcy receptor of dendritic cells (DCs) through 
its ‘constant’ region. The DCs then engulf 
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Figure 1 | Two opposing roles for B cells in cancer immunology. a, Carmi et al.° show that B cells 
secreting natural allogeneic antibodies (those produced in a genetically different individual) of the IgG 
class can stimulate a host’s immune response against cancer cells. Antibodies recognizing surface molecules 
(antigens) on tumour cells can transfer these antigens to dendritic cells (DCs), whose Fcy receptor binds 

to the antibody’s constant region. If the DCs are activated by adjuvant signals, they can then present 

these tumour antigens (attached to surface MHC molecules) to tumour-specific CD8" T cells, inducing 
their cell-killing activity. b, Shalapour et al.’ show a different role for B cells during oxaliplatin-based 
chemotherapy. The drug induces the intratumoral expression of transforming growth factor 8 (TGF-B), 
which causes B cells to switch to producing IgA antibodies. These IgA-producing B cells secrete the broadly 
immunosuppressive molecule interleukin-10 (IL-10) and express PD-L1, the ligand for the T-cell receptor 
PD-1. Through these functions, the B cells inhibit T-cell activity that would otherwise be stimulated by the 
combination of DCs presenting antigens from dying tumours and intrinsic adjuvant signals. 


portions of the tumour cells, extract their 
antigens and present them to specific T cells 
(of the cytotoxic CD8* subset), which eventu- 
ally activates the T cells to kill the cancer cells 
(Fig. 1a). These tumour-specific antibodies 
were ‘natural’ antibodies that were produced 
by the mice without any immunization. 

The authors then used several experimen- 
tal approaches to show that natural allogeneic 
antibodies of the immunoglobulin G (IgG) 
class induce potent T-cell-mediated anti- 
cancer immune responses. First, systemic 
injection of natural IgG antibodies prevented 
the engraftment of tumour cells transplanted 
between mice that were genetically identical to 
one another (syngeneic) but distinct from the 
mice from which the antibodies were purified 
(allogeneic). Second, when tumour cells were 
coated with natural allogeneic IgG antibodies 
and incubated with syngeneic DCs ex vivo, the 
DCs extracted tumour antigens from the cells, 
and mice immunized with these DCs were 
protected against tumour formation follow- 
ing tumour-cell transplantation. Third, injec- 
tion of allogeneic IgG antibodies directly into 
established mouse tumours (breast, pancreatic 
and lung tumours, and melanomas), in combi- 
nation with adjuvant compounds that activate 
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tumour-associated DCs, induced an immune 
response that eradicated the tumours. Remark- 
ably, this latter protocol not only removed the 
primary tumour but also eradicated second- 
ary (metastatic) tumours in distant organs. 
The authors also provide evidence suggesting 
that the anticancer effects induced by alloge- 
neic IgG antibodies are ultimately mediated by 
CD8* T cells targeting tumour antigens. 

It remains to be determined whether this 
approach could be used to treat human can- 
cers. Such attempts would involve pooling 
IgG antibodies from healthy individuals for 
direct injection into the tumours of patients, 
together with adjuvants. This principle could 
first be assessed in tumour explants incubated 
with allogeneic IgG antibodies from different 
individuals. 

However, the contribution of B cells to 
anticancer immunity is not always positive. 
Shalapour et al.’ provide evidence that a sub- 
population of B cells suppresses the activation 
of tumour-targeting CD8* T cells in prostate 
cancers treated with the drug oxaliplatin 
(Fig. 1b). Oxaliplatin is a chemotherapeutic 
agent that not only kills cancer cells but also 
stimulates antitumour immune responses, 
inducing cancer-cell stress and death in a way 
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that allows DCs to take up the tumour antigens 
and present them to tumour-specific CD8* 
T cells*”. 

Shalapour and colleagues find that oxali- 
platin induces both T cells and B cells to 
infiltrate the tumour, and that deleting B cells 
improves T-cell infiltration, thereby improving 
the effectiveness of the treatment at retarding 
tumour growth. They show that the B cells 
responsible for these effects express IgA anti- 
bodies and produce the immunosuppressive 
signalling molecule interleukin-10. After oxali- 
platin treatment, these B cells also express the 
surface molecule PD-L1, the ligand of the 
PD-1 receptor that is expressed on T cells and 
can paralyse their function if bound. 

Furthermore, oxaliplatin induced the 
expression of transforming growth factor 8 
(TGF-8) in the tumour, which was required 
to attract the B cells. The authors show that 
mice lacking TGF-f receptor 2 in their B cells, 
or mice unable to secrete IgA antibodies, had 
reduced expression of PD-L1 on intratumoral 
B cells following oxaliplatin treatment, and 
that these mice responded better to the treat- 
ment than wild-type mice, showing a greater 
reduction in tumour size. Inhibiting PD-L1 
with blocking antibodies in wild-type mice 
after oxaliplatin treatment had a similar 
beneficial effect. 

The findings in these two papers underscore 
the ambiguous role of B cells and their prod- 
ucts in tumour immunology. On the one hand, 
IgG antibodies that recognize tumour antigens 
can trigger the crucial first step of a cellular 
immune response: the Fcy-mediated uptake 
of tumour antigens by DCs. Whether such 
antibodies can mediate additional anticancer 
functions, for instance by antibody-dependent 
cellular cytotoxicity, remains to be explored”. 
But on the other hand, B cells that infiltrate 
the tumour and produce IgA can exert local 
immunosuppression. At this stage, it is not 
clear whether the IgA antibodies secreted by 
these cells must specifically recognize tumour 
antigens to subvert cellular immunity. 

Regardless of the unknowns, and the possi- 
bility that these findings are highly dependent 
on cancer type, these newly described roles 
of B cells in anticancer immunosurveillance 
present a promising frontier for therapeutic 
exploitation. Further opportunities to stimu- 
late immune defences against malignant cells 
will arise as we learn more about the role of 
humoral immunity in cancer, including how 
tumour antigens are recognized by antibod- 
ies, the regulation of antibody class switching 
(to and from IgG and IgA isotypes), and the 
functional characteristics of distinct B-cell 
subpopulations. = 
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Nanoscale connections 
for brain-like circuits 


Tiny circuit elements called memristors have been used as connections in an 
artificial neural network — enabling the system to learn to recognize letters of 
the alphabet from imperfect images. SEE LETTER P.61 


ROBERT LEGENSTEIN 


he human brain is a network of billions 

of neurons that communicate through 

some 10” synaptic connections. Our 
cognitive abilities result from computations 
performed in this vast network, which is shaped 
by experience as learning drives changes in 
the strengths of synaptic coupling. Synthetic 
neuromorphic circuits use the same massively 
parallel architecture in complementary metal- 
oxide-semiconductor (CMOS) technology, 
which underpins much of the circuitry in con- 
ventional computers. But designing neuro- 
morphic chips that approach the connectivity 
of the human brain remains challenging. On 
page 61 of this issue, Prezioso et al.' report a 
major advance in the field — an 
artificial neural network that 
learns to solve a visual-recogni- 
tion task on the basis of artificial 
synapses formed from devices 
called memristors. 

Nearly all contemporary com- 
putational devices are based on 
a design known as the von Neu- 
mann architecture. The Achilles 
heel of this incredibly success- 
ful approach is the separation 
of computation and memory: 
although data are manipulated in 
the central processing unit, they 
are stored in a separate random- 
access memory. Any operation 
therefore involves the transfer 
of data between these compo- 
nents. Known as the von Neu- 
mann bottleneck’, this renders 
the computation inefficient. 

An alternative model is offered 
by the architecture of the brain, 
in which computation and mem- 
ory are highly intermingled. The 
‘program’ — which includes 


previously observed data and memories — is 
stored in the strengths of synaptic con- 
nections directly adjacent to the neuronal 
processing units. Derivatives of this archi- 
tecture, known as artificial neural networks, 
have been investigated since the inception of 
computer science”. 

Artificial neural networks are not pro- 
grammed like conventional computers. 
Just as humans learn from experience, they 
acquire their function from data during a 
training phase. Human-like performance has 
recently been obtained for several tasks” by 
using huge data sets to train large networks 
containing hundreds of millions of connec- 
tions. This research has further fuelled inter- 
est in brain-inspired neuromorphic hardware 


Figure 1 | A memristive neural network. The cartoon depicts a fragment of 
Prezioso and colleagues’ artificial neural network’, which consists of crossing 
horizontal and vertical wires that have memristor devices (yellow) at the 
junctions. Input voltages V, to V; (the network inputs) drive currents through 
the memristors, and these currents are summed up in the vertical wires. Artificial 
neurons (triangles) process the difference between currents in neighbouring 
wires to produce outputs f, and f,. The plus and minus symbols on the neurons 
indicate that the output depends on current differences. 
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that emulates neuronal computation more 
directly than conventional hardware in a mas- 
sively parallel design. But communication 
between emulated neurons is a crucial factor, 
and so most of the chip area and power usage 
of neuromorphic hardware are inefficiently 
consumed by CMOS circuits that act as arti- 
ficial synapses. 

Memristors seem to offer an ideal solution 
to this problem®. These devices are resistors 
that have an analog memory conceptually sim- 
ilar to that of biological synapses. Memristor 
arrays can be fabricated at extremely high den- 
sity, operate at ultra-low power, and capture 
key aspects of biological synaptic plasticity (the 
ability of synaptic connections to strengthen 
or weaken as a function of the connected 
neurons activity). But using memristors as 
artificial synapses has proved difficult because 
of high device-to-device variability — even 
when two devices are fabricated with identi- 
cal parameters, their actual behaviour can be 
quite different. 

Enter Prezioso and colleagues. They have 
fabricated a memristive crossbar array con- 
sisting of a grid formed by 12 horizontal and 
12 vertical metal wires, with connectors made 
of layers of aluminium oxide and titanium 
dioxide between the wires at the crosspoints. 
A memristor is thereby formed at every inter- 
section of wires. The authors used this array 
to implement artificial synaptic 
connections for a simple neural 
network (Fig. 1). 

Prezioso et al. used junction 
parameters (such as the layer 
thicknesses) that they had previ- 
ously determined in exhaustive 
tests to minimize memristor vari- 
ability. This allowed the authors 
to produce a crossbar without 
the need for additional transis- 
tors at crosspoints to compen- 
sate for variability: avoiding the 
use of compensatory transistors 
is a prerequisite for high network 
connectivity. 

Because neural networks adapt 
their synaptic strengths during 
training, it is important that the 
conductance of memristors can 
also change during operation. 
Prezioso and colleagues dem- 
onstrated that their system has 
this capability in an experiment 
in which the network quickly 
learned to report which letter 
of the alphabet was shown in 
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a ‘noisy image. Although a simple task, this 
achievement is remarkable because the con- 
tinuous conductance changes of memristors 
are notoriously noisy and non-symmetric 
— that is, increases in conductances often 
have different amplitudes from analogous 
decreases, which causes problems for learning 
algorithms. The authors used two memristors 
for each synaptic connection, such that the 
strength of a synapse was given by the differ- 
ence between two memristor conductances. 
This differential implementation of synaptic 
strengths has several benefits — for exam- 
ple, it reduces the impact of non-symmetric 
conductance changes because each synaptic 
update involves the change of two conduct- 
ances in opposite directions. 

Prezioso and co-workers result is a proof- 
of-concept for hybrid CMOS-memristor 
neuromorphic circuits. If this design can be 
scaled up to large network sizes, it will affect 
the future of computing. Computer scientists 
have struggled to design algorithms for jobs 
that humans perform easily, such as visual 
tasks (distinguishing objects in a scene, for 
example), speech recognition and coordina- 
ting muscles and limbs to perform a motor 
task. Large neural networks can learn such 
tasks from massive data sets’. Brain-inspired 
hardware would therefore complement the 
strengths of conventional computers. In the 
future, laptops, mobile phones and robots 
could include ultra-low-power neuromorphic 
chips that process visual, auditory and other 
types of sensory information. 

Of course, more research is necessary 
to achieve these goals. With an area of 
200 x 200 nanometres, the memristive devices 
used by Prezioso et al. are still relatively large 
compared with other state-of-the-art memris- 
tors, and the network described is quite simple. 
Much larger networks will need to be created, 
with higher numbers of memristors per unit 
area, for applications to be realized. Also, the 
researchers used a batch-learning set-up, in 
which the whole training data set had to be 
processed for each update of memristive con- 
ductances. This training set-up would there- 
fore require many extra circuit components 
to provide large amounts of memory outside 
the memristive crossbar array. Future research 
must explore how efficient learning procedures 
for memristive crossbar arrays can be achieved 
without the need for external memory. m 
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Tasteless pesticides 
affect bees in the field 


Two studies provide evidence that bees cannot taste or avoid neonicotinoid 
pesticides, and that exposure to treated crops affects reproduction in solitary bees 
as well as bumblebee colony growth and reproduction. SEE LETTERS P.74 & P.77 


NIGEL E. RAINE & RICHARD J. GILL 


pollination of agricultural crops and wild 

plants'”, helping to ensure food secu- 
rity and maintain biodiversity. Yet a range of 
environmental stressors are threatening bee 
populations around the world®°. The impact 
of pesticide exposure, particularly from 
neonicotinoid insecticides, has received sub- 
stantial recent research attention”® and has 
become a topic of public debate. Studies that 
have reported adverse effects of neonicotinoids 
on bees have been criticized for several rea- 
sons: that exposure tests are carried out under 
laboratory or semi-field settings rather than 


[et such as bees are crucial for the 


in the field and use pesticide-treated foods 
containing unrealistically high dosages; and 
that bees can detect chemical residues on 
treated crops and avoid foraging on them. Fur- 
ther weight has been added to such criticisms 
because the few field studies that have inves- 
tigated potential impacts on honeybees and 
bumblebees from exposure to neonicotinoid- 
treated crops have been interpreted to show 
little or no effect’, although limitations to 
these studies have been highlighted”. Two 
studies in this issue strike at the heart of these 
evidence gaps and improve our understanding 
of pesticide exposure risks to bees. 

In their paper, Kessler et al.'* (page 74) 
present a carefully controlled laboratory study 
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Figure 1 | Bee foraging options and pesticide exposure. Non-flowering crops and pasture cover large 
areas of rural land, but typically provide limited food resources for bee populations. Flowering crops can 
provide plentiful (although non-diverse) bee food, but are often treated with pesticides, a direct route for 
exposure. Flower-rich meadows provide a diverse bee diet, but these are becoming increasingly scarce, and 
small areas may support only low bee numbers. Furthermore, wild flowers in field margins may contain 
pesticide residues. Rundléf et al.'° show that the growth rates and reproduction of bumblebee colonies are 
lower in neonicotinoid-treated fields than in control fields, and that reproduction of solitary bees can also 
be affected. However, the authors found no effect on honeybee colonies. These differences may result from 
different ecologies: honeybees can forage many kilometres from their hive, whereas bumblebees roam 
over smaller areas, and solitary bees fly less far from their nest. Honeybees also use the waggle dance to 
communicate the location of rewarding flower patches to nest-mates. Thus, honeybees may have reduced 
pesticide exposure from visiting a greater mixture of foraging sources or through a greater chance of 
avoiding treated crops. However, Kessler et al.'* show that neither honeybees nor bumblebees can taste 
neonicotinoids, suggesting that such avoidance behaviour is unlikely. (Nest sites, foraging ranges and the 
relative proportion of habitat types vary across landscapes — those depicted are representative only.) 
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testing the ability of both honeybees (Apis 
mellifera) and bumblebees (Bombus terrestris) 
to taste the three most commonly used neo- 
nicotinoids — clothianidin, imidacloprid and 
thiamethoxam. When hungry worker bees 
could choose to collect from feeders contain- 
ing either a solution of neonicotinoid-treated 
sugar water or an untreated solution, neither 
species avoided the treated food, which con- 
tained neonicotinoid concentrations compa- 
rable to those found in the nectar and pollen of 
treated crops. Surprisingly, the bees in fact pre- 
ferred the treated solution in the imidacloprid 
and thiamethoxam tests, which the authors 
suggest arises from the pharmacological action 
of these insecticides on receptors in the bees’ 
brains. The authors corroborated their behav- 
ioural results with neurophysiological meas- 
urements showing that bees are unable to taste 
neonicotinoids in sugar water. 

Scaling up from the laboratory, Rundléf 
et al.'° (page 77) undertook an ambitious study 
to assess the impacts of neonicotinoid exposure 
on bees placed near fields of treated oilseed rape 
(also known as canola). The experiment — the 
largest of its kind so far — involved 16 fields 
across southern Sweden: 8 fields were planted 
with seeds treated with the systemic insecti- 
cide clothianidin, the pyrethroid insecticide 
6-cyfluthrin and the fungicide thiram, and 
8 control fields were treated solely with thiram. 
Like Kessler et al., these researchers studied 
both honeybees and bumblebees, but followed 
entire colonies rather than individuals. 
Furthermore, they monitored nests of a 
species of solitary bee (Osmia bicornis), as well 
as surveying wild bees in field margins. 

In treated fields, Rundlof and colleagues 
found fewer wild bees and observed reduced 
growth rate and reproduction of bumblebee 
colonies (which produced fewer males and 
fewer new queens — consistent with previous 
semi-field and field studies'’*!”'*) compared 
to control fields. They also found that none 
of the solitary bees that emerged from nests 
placed next to treated fields came back to their 
natal nest to build new brood cells, whereas 
emergent females successfully produced brood 
cells in six of eight untreated fields. By contrast, 
there was no significant difference in honey- 
bee colony growth between treated and control 
fields. However, the authors’ power analysis 
indicated that they would only have been able 
to detect a minimum effect size of about 19% 
for honeybees. 

These studies provide timely data to address 
calls for further evidence about the environ- 
mental risks of neonicotinoids. The insecti- 
cides tested by the authors are currently subject 
to a European Union moratorium for use as 
seed treatments on crops attractive to bees, but 
this usage restriction will be reviewed before 
December 2015. It is hard to say whether 
the preferences observed by Kessler and col- 
leagues for nectar containing imidacloprid 
and thiamethoxam residues would occur in 


a more complex field setting, where many 
variables could interfere with foraging deci- 
sions. However, their study does imply that 
foraging bees are unlikely to avoid seed-treated 
crops in the field, and supports previous 
reports of honeybees and bumblebees bringing 
back nectar and pollen from treated fields”'*"®. 
If the preference for treated food does apply in 
the field, these findings suggest that we could 
be underestimating the exposure risk to bees 
from treated crops. 

Both studies also highlight the fact that 
different bee species vary in their responses to 
exposure. Current pesticide registrations rely 
on ecotoxicological testing of just one spe- 
cies, the honeybee, when assessing risks for all 
insect pollinators. Yet Rundléf and colleagues 
found negative effects of neonicotinoids on 
solitary bees and bumblebees in the field, but 
not on honeybees, suggesting that a single 
species might not represent the responses of 
other pollinators. Potential explanations for 
these apparent differences could include a vari- 
able affinity of neuronal receptors for binding 
neonicotinoids; differences in detoxification 
capacities; and divergent foraging behaviours, 
which influence levels of exposure (Fig. 1). 
Differences could also result from variation in 
social organization and life-history strategies. 
Even the smallest perennial honeybee colonies 
contain a queen and several thousand work- 
ers that overwinter as a group, whereas annual 
bumblebee colonies rarely contain more than 
a queen and a few hundred workers. Each 
solitary bee is responsible for its own forag- 
ing and reproduction during its few weeks of 
adult life. The sheer number of workers in the 
honeybee colony may better enable buffering 
of stress over long periods, whereas the more 
severe pinch points that bumblebees and soli- 
tary bees experience could render them more 
susceptible to environmental pressures”. 

If field experiments to assess exposure are 
deemed so important, why have so few been 
carried out? Limiting factors include the 
scale of such studies, the levels of replication 
required to achieve appropriate statistical 
power, and human and budgetary resources. 
Even with 16 fields, Rundléf and colleagues’ 
study had relatively low statistical power and, 
as with other field studies, many environmen- 
tal factors probably varied among their sites 
and could not be standardized. Such studies 
can provide only correlational evidence of 
impacts, whereas controlled-exposure studies, 
such as that of Kessler et al.!, are better suited 
to determining causative relationships through 
manipulative experimentation. The comple- 
mentarity of these two approaches needs to be 
considered by policy-makers and for future 
research planning. 

Although the two latest studies contribute 
to our understanding of the risk neonicoti- 
noids pose to bees, knowledge gaps remain. 
For example, we need further evidence about 
how neonicotinoid exposure might affect 
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50 Years Ago 


“The sign of the constant of 
gravitation. By Prof. W. H. McCrea 
— A speaker in a recent broadcast 
asserted that, were the gravitation- 
constant negative instead of 
positive, Newton's apple would have 
soared away into the sky instead of 
falling on Newton's head. However, 
had that happened, Newton also 
would have soared away and there 
would have been no legend to 
record. In fact, there cannot bea 
world for which gravitation is not 
attractive ... We shall see that the 
sign of the gravitation constant is 
essentially a matter of convention. 
From Nature 8 May 1965 


100 Years Ago 


It may be remembered that the 
Royal Commission on Whisky, 
which in 1908-9 gave a lengthy 
consideration to the matter, did not 
find a very satisfactory answer to 
the query “What is whisky?” The 
Government of Western Australia 
... issued regulations under which 
certain chemical standards for “pure 
pot-still whisky” were proposed for 
adoption. The proposals met with 
some criticism. It was alleged, in 
fact, that many pot-stills employed 
in Great Britain could not produce 
whisky which would comply with 
the requirements ... the proposals, 
as now modified ... are that, as 
regards Scotch whisky, it shall have 
been distilled at a strength not more 
than 35 degrees above proof and 
matured in wood for not less than 
two years; and that “standard pot- 
still whisky” shall contain at least 

45 grams of esters, 3.5 of furfural, 
and 180 ofhigher alcohols per 

100 litres of absolute alcohol ... For 
Irish whisky no furfural standard 

is proposed at present, but the 
proportion of esters is required to be 
not less than 35 grams, and of higher 
alcohols 200 grams, per 100 litres of 
absolute alcohol. 

From Nature 6 May 1915 
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social bee colonies over multiple seasons, how 
soil residues might affect ground-nesting bees 
and how neonicotinoid exposure interacts with 
other environmental stressors. We also need a 
greater understanding of how neonicotinoids 
affect other pollinators and natural enemies 
of crop pests, and of the persistence of these 
chemicals in soil and their take-up by untreated 
plants growing in or next to treated fields. 

Fundamentally, we must move towards 
finding the right balance between the risks 
of neonicotinoid exposure for insect pollina- 
tors and the value these pesticides provide to 
ensure crop yield and quality. Selective use of 
neonicotinoid seed treatments, on the basis 
of a demonstrable need for systemic pest 
protection, might help to reduce non-target 
exposure and slow the onset of pest resist- 
ance. We also need to consider and evaluate 
alternative options for pest control. It would 
be unfortunate if the recent focus on the risks 
from neonicotinoids led unintentionally to 
broader use of alternative pesticides that prove 
to be even more harmful to insect pollinators 
and the essential ecosystem services that they 
provide. mSEE BOOKS & ARTS P.29 
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Dinosaur up 1n the air 


A new feathered dinosaur from China, belonging to an obscure and strange 
carnivorous group, bears a seemingly bony wrist structure that may have 


had a role in flight. SEE LETTER P.70 


KEVIN PADIAN 


hen the first dinosaurs with feathers 

or feather-like structures were 

brought to light by Chinese 
scientists in the mid-1990s, they cemented the 
hypothesis of the dinosaurian origin of birds 
and provided spectacular evidence about the 
origin of flight and the primordial functions of 
feathers’”. In the ensuing two decades, the pic- 
ture of the evolution of feathers and flight has 
become richer and more complicated as other 
feathered dinosaurs have been discovered, 
seemingly ona monthly basis’. But things have 
just gone from the strange to the bizarre. On 
page 70 of this issue, Xu et al.’ present a feath- 
ered dinosaur from a completely unexpected 
branch of the dinosaur tree — and it sports a 
never-before-seen skeletal element that the 
authors think may be related to flight. 

The dinosaur, named Yi qi, isa member of 
the unusual group of theropod (carnivorous) 
dinosaurs called Scansoriopterygidae’ (Fig. 1). 
Although scansoriopterygids are not well 
known — there are only three kinds, and only 
incomplete remains of these — they seem to 
be small (the skull of Yi qi is about 4 centime- 
tres long), with smaller and fewer teeth than 
other theropods, and with long hands. Unusu- 
ally for theropod dinosaurs, the third finger is 
longer than the second. It is especially long in 
Yi qi, and its forelimb is further distinguished 
by a long ‘styliform element coming off the 
wrist. The authors are refreshingly agnostic 
about the exact function of this new struc- 
ture, partly because it is so different from any- 
thing previously known. However, their find 
opens two cans of worms: about interpreting 
unique structures in fossils and about what it 
means to fly. 

The styliform element, which may be a 
hypertrophied wrist bone or a neomorphic 
calcified structure, is longer than any of the 
animal’ fingers and is curved at both ends. It 
is probably not a true finger, such as the fourth 
finger that forms the main outboard spar of 
the wing in pterosaurs — flying reptiles that 
were related to their dinosaur contemporaries, 
but that evolved flight independently of birds. 
Instead, the styliform element of Yi qi has no 
joints and comes directly off the carpal bones 
without the intermediary of a metacarpal 
(palm bone), so it is probably not a finger (the 
authors confusingly label the three fingers of 
the dinosaur-bird hand as II-III-IV instead 
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of the standard I-II-IH, which corresponds 
to our thumb, index and middle fingers®). 
How the structure is attached to the wrist is 
not clear, because its proximal end seems quite 
squared off; this means that we also do not 
know if or how it could move. Superficially, it 
looks the same as the animal’s other bones, a 
conclusion supported by the authors’ energy 
dispersive spectrometry analysis. It would be 
interesting to see if it has true bone cells, to 
confirm that it is indeed bone and not calcified 
cartilage or other calcified tissue. 

What could this element be except a 
support for some kind of aerofoil? The authors 
infer this on the basis of its position and the 
presence of membranous tissue in the wrist 
area. But although they consider a variety of 
analogous structures in living and extinct fly- 
ing and gliding animals, none is exactly like 
the styliform element in Yi qi. Furthermore, 
Yi qis body is not preserved below the ribcage, 
so reconstructions of the pelvis, hindlimbs and 
tail must be conjectured from what is known of 
other scansoriopterygids (Fig. 1). Further ana- 
tomical analysis of this structure and how the 
rest of the body related to it — such as whether 
the tail created lift or drag — will require other 
discoveries. 

In the meantime, the authors do not 
commit themselves to whether this animal 
could flap or glide, or both, or neither. That 
is a good position to take’, but we can parse it 
further. To fly actively, an animal must be able 
to execute a flight stroke that can generate a 
vortex wake that propels it forward*. No evi- 
dence presented so far suggests that Yi qi had 
this ability. Furthermore, in flapping animals 
the outboard skeletal elements (wrist, hand 
and so on) are primarily responsible for thrust, 
the essential component of powered flight*”, 
but these are not particularly long in Yi qi. So, 
at present we can shelve the possibility that this 
dinosaur flapped. 

As for gliding, if Yi qi’s styliform element 
helped to support a membranous aerofoil, it 
can be used to reconstruct the planform of the 
wing, as Xu and colleagues have done. But ina 
gliding animal, the centre of lift of the aerofoil 
should be fairly congruent with the centre of 
gravity of the body — if the bulk of the animal's 
weight falls too far behind the centre of lift, 
the back end will sag and the animal will stall’. 
That is clearly the case in the authors’ recon- 
struction of Yi qi, but an aerofoil that was swept 
back more, if anatomically possible, might 


LIDA XING 


a_ Epidendrosaurus ningchengensis 


Figure 1 | The three known members of the Scansoriopterygidae. Fossils 
have been found from three members of this group of theropod dinosaurs: 
Epidendrosaurus ningchengensis, Epidexipteryx ningchengensis and the newly 
discovered form reported by Xu et al.*, named Yi qi. They are all unusual for 
theropods because the third finger is longer than the second, rather than the 


have mitigated this problem. Still, we are left 
in a quandary: an animal with a strange struc- 
ture that looks as if it could have been used 
in flight, borne by an animal that otherwise 
shows no such tendencies. And so far, there is 
no other plausible explanation for the function 
of this structure. 

Despite this aeronautic uncertainty, the 
paper is a milestone for another reason. The 
Yi qi fossil was found by a farmer, which is the 
case for many Chinese fossils. But Xu and col- 
leagues provide more complete information 
about the geographical and geological prove- 
nance of their specimen than has accompanied 
other recent Chinese fossils collected by non- 
scientists. The Supplementary Information to 


MATERIALS CHEMISTRY 


b  Epidexipteryx ningchengensis 


Scale bars, 5cm. 


the paper documents how the authors verified 
the provenance of the specimen and even 
excavated fossils in associated local sediments. 
Moreover, they examined the specimen meti- 
culously to be sure that none of its elements had 
been faked or restored. This is a key advance 
and sets the standard for future publications 
of specimens procured from third parties. The 
authors are due thanks for this diligence from 
the entire palaeontological community. = 
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Organic polymers form 
fuel from water 


Porous polymers have joined the ranks of light -activated catalysts that split 
water into hydrogen, a carbon-free alternative to fossil fuels. Their properties 
are easily tuned — a big plus for the development of practically useful catalysts. 


VIJAY S. VYAS & BETTINA V. LOTSCH 


he Sun could be harnessed as an 

unlimited source of energy by exploit- 

ing another naturally abundant 
resource: water. The light-induced splitting 
of water into oxygen and hydrogen gener- 
ates storable chemical fuels that have no 
carbon footprint, potentially solving the world’s 
ever-increasing energy demands. The seem- 
ingly simple task of absorbing sunlight to split 
water requires a semiconductor catalyst, and 
inorganic catalysts are leading the field. Writ- 
ing in the Journal of the American Chemical 


Society, Sprick et al.' demonstrate that organic 
photocatalysts (light-activated catalysts) 
may become just as useful as their inorganic 
counterparts, and offer intriguing opportuni- 
ties for future research because the catalysts’ 
physical responses to light can be tailored. 
Designing photocatalytic materials for water 
splitting is far from easy: not only should they 
absorb light efficiently to form photoexcited 
states, but also the excitations should be 
long-lived and effectively lead to separation 
of charges at the catalyst’s surface, where the 
redox reactions needed for water splitting 
occur. Particulate photocatalysts often require 
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other way round. It is considerably longer in Epidendrosaurus and even more 
so in Yi qi. The new dinosaur also sports a unique ‘styliform element’ on the 
wrist that seems to be made of bone and to have had a membranous structure 
attached. Reconstructions have been inferred from incomplete skeletons. 
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additional ‘sacrificial’ agents with a larger 
thermodynamic driving force than water to 
accept a light-generated charge carrier. This 
can dramatically facilitate charge separation, 
the largest bottleneck in the photocatalytic 
process. Moreover, the vast majority of photo- 
catalysts are unlikely to work efficiently alone 
— aco-catalyst such as platinum or another 
noble metal is needed to lower the energy 
losses associated with hydrogen or oxygen 
evolution. 

Substantial research efforts are therefore 
focused on finding materials that offer better 
light harvesting, charge transport and conver- 
sion of water to hydrogen and oxygen. Most 
of these materials are inorganic semiconduc- 
tors’, which are highly robust, but whose 
properties are often barely tunable. Although 
metal-free photocatalysts such as carbon 
nitrides have been reported’, soft organic 
polymers have not yet found their place in the 
race. Sprick and colleagues now highlight the 
potential of porous organic polymers for pro- 
ducing hydrogen. These soft materials offer 
ample opportunities for systematic engineer- 
ing of their bandgap, a property that governs 
what part of the solar spectrum is absorbed, 
and that can be manipulated to improve the 
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effectiveness of photocatalysis. 
Organic materials formedfrom 4 

layers of two-dimensional atomic 

networks, which include crystal- 

line covalent organic frameworks* 

and amorphous conjugated micro- 

porous polymers (CMPs)’, are 

chemically inert, thermally stable 

and have potentially useful opto- 

electronic properties combined 

with high surface areas. They have 

therefore been widely used for gas 

adsorption, chemical sensing and b 


catalysis’. These materials can 
be made from a broad range of 
organic building blocks and bond- g 
forming reactions, thus providing SS 
an extensive toolbox for the sys- 2 s 
tematic fine-tuning of their struc- 8 
tural and physical properties. ay 
Sprick et al. prepared a series of 
15 different CMPs from phenylene 
and pyrene building blocks, using 
palladium-catalysed reactions 
(Fig. 1). The polymers exhibit a 
continuous variation in optical 8-2 
properties that depends on the 2 E 
ratio of phenylene to pyrene units: cp 
increasing the pyrene content 23 
causes a gradual decrease inthe $e 
optical bandgap from 2.95 electron- sg 
volts to 1.94 eV, an effect that allows 2 


the polymers to absorb increased 
amounts of the solar spectrum. The 
researchers rationalize this trend by 
proposing that low-energy optical 
excitations in the polymers become 
dominated by contributions from 
pyrene’s molecular orbitals as the 
pyrene content rises. They also sug- 
gest that structural effects — such 
as the formation of cyclic substruc- 
tures (rings) and the strain within 
them — become more dominant 
as the pyrene component increases. 

The authors tested the porous polymers for 
their ability to catalyse hydrogen evolution 
from water in visible light, using the organic 
compound diethylamine as a sacrificial elec- 
tron donor. Remarkably, all of the polymers 
promoted stable hydrogen evolution for at 
least 6 hours, and the best polymer was shown 
to work for more than 100 hours without 
much decline in activity. This behaviour was 
predicted by the researchers’ theoretical cal- 
culations, which showed that all the CMPs 
have a strong thermodynamic driving force 
to promote hydrogen evolution. Sprick et al. 
observed no signs of light-induced degrada- 
tion of the CMPs in their experiments; such 
stability is a key prerequisite for any catalyst if 
it is to be more than just a laboratory curiosity. 

On the basis of the CMPs’ optical proper- 
ties, the rate of hydrogen evolution would be 
expected to increase across the series of poly- 
mers (that is, as the bandgap decreases). But 
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the rate of hydrogen evolution, 
Sprick and colleagues devised a 
palladium-free synthesis for their 
photocatalysts, deliberately added 
platinum to some of their experi- 
ments, and performed other tests in 
which carbon monoxide was added 
to ‘poison’ any traces of palladium. 
All of these studies suggested that 
the rate of hydrogen evolution cor- 
relates more strongly with the opti- 
cal bandgap than with noble-metal 
content. 

Another notable feature of the 
CMP photocatalysts is their selec- 
tive visible-light activity, with 
almost no activity observed in 
ultraviolet light. This unusual bias 
renders them ‘true’ visible-light 
photocatalysts, boding well for the 
future design of photocatalysts that 


it 


14 15 


absorb a large fraction of the visi- 
ble-light spectrum with maximum 
light-harvesting efficiency. 
Against the background of inor- 
ganic semiconductor photocataly- 
sis, Sprick and co-workers’ findings 
highlight the power of organic 
photocatalysis, with its armoury of 
molecular-engineering protocols 
suitable for producing complex 
catalysts that have improved band- 
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Figure 1 | Optical and photocatalytic properties ofa series of porous 
polymers. a, Sprick et al.’ have prepared 15 conjugated microporous polymers 
(CMP1 to CMP15) from a mixture of phenylene-containing (red) and pyrene- 
containing (blue) building blocks, increasing the ratio of pyrene to phenylene 
across the series. b, The optical bandgap of the polymers decreases as the pyrene 
content increases, implying that the polymers ability to catalyse hydrogen 
production from water when irradiated with visible light should increase across 
the series. In fact, the rate of hydrogen evolution peaked for CMP 10, possibly 
because of mechanisms that reduce the availability of electrons to take part in 
the hydrogen-producing reaction in CMP11 to CMP15. 


the authors observed that hydrogen evolution 
reaches a peak for the CMP that has a band- 
gap of 2.33 eV (Fig. 1); it then declines for the 
remaining polymers that have smaller band- 
gaps. The authors suggest that recombination 
of separated charge carriers (which prevents 
electrons from being transferred from the 
polymer) becomes dominant in CMPs that 
have smaller bandgaps, or that the kinetic 
barrier to electron transfer increases, both of 
which would reduce hydrogen evolution. 
Remarkably, the polymers are active in the 
absence of any deliberately added noble met- 
als, providing a possible solution to the long- 
standing and much-researched question of 
how to minimize the amount of these expen- 
sive co-catalysts that is required. But as the 
authors point out, traces of palladium trapped 
in the CMPs during their synthesis might act as 
masked co-catalysts. To rule out the possibility 
that residual noble metals appreciably affect 
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gaps. But as with all breakthroughs, 
there is more to be done: greater 
insight into charge-carrier dynam- 
ics and the nature of the catalyti- 
cally active sites will be required to 
further increase the efficiency of 
polymer photocatalysts and make 
them practically viable. 

Tuning the catalytic activity 
will necessitate subtle control of 
the polymers’ composition, and 
especially of their crystallinity and 
microstructure. This poses grand 
synthetic challenges, and will probably require 
the development of alternative cheap, revers- 
ible polymerization protocols. Finally, a single 
porous polymer that catalyses ‘complete’ water 
splitting — which does not require sacrificial 
electron donors — has yet to be realized. The 
race is On. m 
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Sequential cancer mutations in cultured 
human intestinal stem cells 


Jarno Drost'?, Richard H. van J aarsveld***, Bas Ponsioen”**, Cheryl Zimberlin”**, Ruben van Boxtel’”, Arjan Buijs’, 
Norman Sachs!?, René M. Overmeer””, G. Johan Offerhaus°, Harry Begthel’?, Jeroen Korving’’, Marc van de Wetering)””, 
Gerald Schwank!?, Meike Logtenberg’”, Edwin Cuppen'”, Hugo J. Snippert”, Jan Paul Medema?*, 


Geert J. P. L. Kops”? & Hans Clevers? 


Crypt stem cells represent the cells of origin for intestinal neoplasia. Both mouse and human intestinal stem cells can 
be cultured in medium containing the stem-cell-niche factors WNT, R-spondin, epidermal growth factor (EGF) and 
noggin over long time periods as epithelial organoids that remain genetically and phenotypically stable. Here we utilize 
CRISPR/Cas9 technology for targeted gene modification of four of the most commonly mutated colorectal cancer genes 
(APC, P53 (also known as TP53), KRAS and SMAD4) in cultured human intestinal stem cells. Mutant organoids can be 
selected by removing individual growth factors from the culture medium. Quadruple mutants grow independently of all 
stem-cell-niche factors and tolerate the presence of the P53 stabilizer nutlin-3. Upon xenotransplantation into mice, 
quadruple mutants grow as tumours with features of invasive carcinoma. Finally, combined loss of APC and P53 is 
sufficient for the appearance of extensive aneuploidy, a hallmark of tumour progression. 


The adenoma-carcinoma sequence proposes that the sequential 
acquisition of specific genetic alterations underlies the progression 
of colorectal cancer (CRC)'. Activation of the WNT pathway, most 
commonly through inactivating mutations in APC, initiates the 
formation of benign polyps. Progression is thought to occur through 
activating mutations in the EGF receptor (EGFR) pathway and 
inactivating mutations in the P53 and transforming growth factor 
(TGF)-B pathways’. Recent sequencing efforts have further explored 
the genomic landscape underlying CRC*. A major hurdle in iden- 
tifying essential driver mutations is that many CRCs have acquired 
either microsatellite instability or chromosomal instability (CIN), as 
tumours typically harbour hundreds to thousands of mutations. 
Using mouse models, Lgr5“-intestinal stem cells were identified 
as cells of origin for intestinal neoplasia and were shown to fuel 
effective tumour growth*®. A recent study has shown that dereg- 
ulation (by retroviral expression of short hairpin RNAs (shRNAs) 
or cDNA) of APC, P53, KRAS and SMAD4 is sufficient for trans- 
formation of cultured mouse colon into tumours with adenocarci- 
noma-like histology’. Of note, the reliance on paracrine growth 
factors provided by a mesenchymal component in this system does 
not allow a one-to-one correlation with the individual oncogenic 
mutations. Comparable human in vitro model systems to study 
tumour initiation and progression have not been developed. We 
have previously described ‘indefinite’ three-dimensional stem cell 
culture systems (organoids) derived from several organs including 
mouse and human small intestine, colon, pancreas and liver that 
remain genetically stable*’. 


Sequential introduction of CRC mutations 


We set out to utilize CRISPR/Cas9-mediated genome editing'*"'* to 
introduce four of the most frequent CRC mutations in human small 
intestinal organoid stem cell cultures. As the absolute knockout 


efficiency is low, we made use of functional selection strategies to 
obtain clonal, mutant organoids. Since loss of APC is generally con- 
sidered to be an early event in CRC’, we first introduced inactivating 
mutations in APC. As previously described, withdrawal of WNT and 
R-spondin from the defined culture medium provides a functional 
selection for APC loss’’ (Fig. 1a). Indeed, control-transfected orga- 
noids died when seeded in medium lacking WNT and R-spondin 
(Fig. 1b), whereas transfection of plasmids expressing Cas9 and single 
guide RNAs (sgRNAs) targeting APC in its mutation hotspot region 
allowed cystic clonal organoids to emerge (Fig. 1b, Extended Data 
Fig. la and Extended Data Table 1a). To obtain clonal cultures, indi- 
vidual organoids were expanded. Genotyping verified the presence of 
clonal insertions or deletions (indels) at the targeted regions 
(Extended Data Fig. 1b). Quantitative reverse transcription polymer- 
ase chain reaction (qRT-PCR) analysis for the WNT target gene 
AXIN2 confirmed the constitutive activity of the WNT pathway, as 
AXIN2 messenger RNA levels did not decrease upon WNT/R-spon- 
din withdrawal (Fig. 1c). 

Next, we introduced inactivating mutations in P53 in APC knock- 
out (APC*°) intestinal organoids. We made use of nutlin-3 (ref. 18) to 
select for organoids with a functionally inactive P53 pathway (Fig. 1a). 
As expected, nutlin-3 stabilized P53 in intestinal organoids and acti- 
vated transcription of its target gene P21 (also known as CDKNIA) 
(Fig. le). Control sgRNA-transfected APC*° organoids died upon 
nutlin-3 treatment (Fig. 1d), whereas transfection of plasmids expres- 
sing Cas9 and sgRNAs targeting P53 enabled organoid outgrowth 
(Fig. 1d, Extended Data Fig. la and Extended Data Table 1a). 
Clonal expansion and genotyping verified the presence of frame- 
shift-inducing indels at the targeted loci (at the start of the DNA- 
binding domain, thereby yielding an inactive gene product; Extended 
Data Fig. 1c). Loss of P53 protein expression and P53 pathway inac- 
tivity were confirmed by western blot (Fig. le). 
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Figure 1 | Inactivation of APC and P53 in human intestinal organoids. 

a, Strategy to generate the indicated mutant lines using CRISPR/Cas9. Blue, 
stem cells. E, EGF; N, noggin; R, R-spondin; W, WNT. b, Wild-type organoids 
in complete medium (WENR; top left) and transfected with Cas9 and the 
indicated sgRNAs selected in EN medium (representative pictures from n = 3 
independent experiments). c, qaT-PCR for AXIN2 in wild-type and APCX° 
organoids in the presence or absence of WNT/R-spondin. Expression 


The most common KRAS mutation in CRC results in the express- 
ion of constitutively active KRAS(G12D). To introduce this mutation, 
we designed an oligonucleotide with the oncogenic mutation and two 
silent mutations to serve as a template for homologous recombination 
(Fig. 2a). KRAS®'”? mutants were selected by withdrawing EGF and 
adding the EGFR inhibitor gefitinib to the culture medium. Of note, 
resident Paneth cells produce EGF in organoids’. Whereas control- 
transfected organoids failed to expand in the selection medium, orga- 
noids transfected with the oligonucleotide, Cas9 and the KRAS 
sgRNA grew out (Fig. 2b and Extended Data Table 1a). Genotyping 
of clonally expanded organoids confirmed that the resistant clones 
harboured the KRAS®'”” mutation (Fig. 2c). The two silent mutations 
were also present in the recombined allele, verifying that the muta- 
tions were introduced using the provided template. Although the 
second KRAS allele did not recombine, it was often targeted by 
Cas9 endonuclease, resulting in a frameshift in the second allele. 


Quadruple mutants do not need niche factors 

We then set out to introduce combinations of CRC mutations. 
We used our KRAS(G12D)-expressing organoids to introduce inac- 
tivating mutations in APC, P53 and SMAD4 (Fig. 2d). To select 
for inactivating mutations in SMAD4, an essential downstream 
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normalized to GAPDH. Horizontal bars represent mean of n = 3 independent 
experiments. d, APCK° organoids were transfected with Cas9 and the indicated 
sgRNAs. P53 mutants were selected in medium with nutlin-3 (representative 
pictures from n = 3 independent experiments). e, Western blot analysis of 
P53 and P21 expression in organoids cultured in the presence/absence of 
nutlin-3 (representative from n = 3 independent experiments). Scale bars, 
100 pm. 


component of the TGF-B and bone morphogenetic protein (BMP) 
pathways, we made use of the dependence of the intestinal organoids 
on the presence of the BMP pathway inhibitor noggin in the culture 
medium’. Using the described selection procedures, transfection of 
Cas9 with either APC or both APC and P53 sgRNAs yielded 
KRAS°"*P/APC*° and KRAS°??/APC*°/P53*° organoids, respect- 
ively (Fig. 2e, f and Extended Data Fig. 2a, b, e). Transfection of Cas9 
together with sgRNAs targeting APC, P53 and SMAD4 yielded orga- 
noids growing in medium lacking EGF, WNT, R-spondin and noggin, 
to which nutlin-3 was added (Fig. 2e, Extended Data Fig. 2d and 
Extended Data Table 1a). Clonal expansion and sequencing of the 
targeted loci in APC and P53 verified frameshift-inducing indels. 
Sequencing of the targeted exon in SMAD4 in several different clones 
revealed a frameshift-inducing deletion in one allele and an in-frame 
deletion in the other allele (sgRNA 1, P356del; sgRNA 3, V370del) 
(Extended Data Fig. 2c). Western blot confirmed reduced protein 
expression in SMAD4-mutated organoids (Fig. 2f). As with all 
sgRNAs, SMAD4 sgRNAs target the mutation hotspot region, encod- 
ing the MH2 domain required for SMAD4 activity’’”°. Recently, in- 
frame deletions of SMAD4 P356 and V370 were shown to occur in 
CRC”, indicating that in-frame indels at these locations yield an 
inactive gene product. Using a candidate off-target prediction tool, 
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Figure 2 | KRAS®??/APC*°/P53*°/SMAD4°° organoids grow in the 
absence of stem-cell-niche factors in vitro. a, Strategy to introduce the 
KRAS®'”” mutation. Asterisks indicate silent mutations; arrowheads indicate 
genotype primers. b, Wild-type human intestinal organoids were transfected 
with Cas9, sgRNA and the oligonucleotide. KRAS@'*” mutants were selected in 
medium lacking EGF, with the EGFR inhibitor gefitinib (representative 
pictures from n = 3 independent experiments). c, Sequence analysis of the 
targeted KRAS exon. Oncogenic GGT>GAT mutation is indicated in green; 


we detected no lesions of predicted off-target sites for the ssRNAs 
used to introduce mutation combinations in our human intestinal 
organoids (Supplementary Table 1). Although this analysis was lim- 
ited, in combination with the analysis of multiple independent clonal 
organoids, the results indicated that the observed effects were not due 
to off-target effects. In conclusion, KRAS°!??/APC*%°/P53%°/ 
SMAD4*° mutant intestinal organoids can grow in the absence of 
all stem-cell-niche factors in vitro (Extended Data Table 1b). 


Quadruple mutants grow as invasive carcinomas 


Next, we investigated whether our engineered organoids were tumori- 
genic in vivo. We subcutaneously injected wild-type and all engi- 
neered mutant organoid lines into immunodeficient mice. After 8 
weeks, some mice injected with KRAS@!??/APC*°/P53*° organoids 
(‘triple’; 3 out of 12 injections) and the majority of mice injected with 
KRAS°??/APC*°/P535°/SMAD4*° organoids (‘quadruple’; 13 out 
of 16 injections) developed visible nodules (Extended Data Fig. 4a). 
Histological analysis confirmed that triple organoids did engraft, but 
remained small with few proliferating cells and mostly resembled 
adenomas (Fig. 3a and Extended Data Fig. 4b, c). Quadruple-derived 


silent mutations are in blue; protospacer adjacent motif (PAM) is underlined in 
red. d, Strategy to generate the indicated mutant lines using CRISPR/Cas9. 
Blue, stem cells. N, noggin (Nog); R, R-spondin; W, WNT. e, KRAS(G12D)- 
expressing organoids were transfected with Cas9 and the indicated ssRNAs 
(representative pictures from n = 3 independent experiments). f, Western blot 
analysis of SMAD4, P53 and APC expression in the indicated organoid lines 
(representative from n = 3 independent experiments). K, KRAS RD. A, 
APC®; P, P53®°; S, SMAD4*°. Scale bars, 100 Lm. 


tumours were larger, highly proliferative and all displayed features of 
invasive carcinoma, including an irregular multi-layered epithelium 
consisting of tumour cells with increased nuclear-cytoplasmic ratio, 
pleiomorphic and hyperchromatic nuclei. Invasion of isolated or 
small aggregates of tumour cells into the stroma was frequently 
observed (Fig. 3b and Extended Data Fig. 4b, d). The tumour origin 
was verified using a human-specific cytokeratin antibody (hKRT; 
Fig. 3 and Extended Data Fig. 4c, d). Thus, introduction of oncogenic 
mutations in KRAS, APC, P53 and SMAD4 enables normal human 
intestinal stem cell organoids to grow as tumours with invasive car- 
cinoma features in vivo (Extended Data Table 1b). In vitro, both 
triple- and quadruple-mutant organoids exhibited a high prolifera- 
tion rate, while only quadruple-mutant organoids frequently 
appeared as solid tumour masses (Extended Data Fig. 5a, c). 


Extensive aneuploidy upon APC and P53 loss 

To determine whether our engineered mutant organoids acquired 
CIN, a hallmark of CRC”, we transduced all human intestinal orga- 
noid lines with a fluorescently tagged histone 2B (H2B)-encoding 
lentivirus. This enabled us to monitor chromosome segregations 
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Figure 3 | Quadruple-mutant organoids grow as invasive carcinomas in 
vivo. a, Haematoxylin and eosin (H&E; top left, bottom left), hKRT (top right, 
bottom middle) and Ki67 (bottom right) immunostainings on nodules isolated 
from triple-mutant-injected mice. Representative pictures of an adenoma with 
regular glandular structures lined with a blander epithelium that only focally 
shows a tendency towards stratification (arrowhead), no invasive growth and 
low proliferative capacity (Ki67). n = 3 mice. b, As in a, but for quadruple- 
mutant-injected mice. Upper and lower boxed regions in top panels correspond 
to regions imaged in left bottom panels and middle and right bottom panels, 
respectively. Representative pictures of an invasive carcinoma with irregular 
glandular architecture, gland in gland formation (white arrowhead) and 
luminal debris (asterisk). Mitotic figures are encountered (arrows) and there is 
high proliferative activity (Ki67). Invasion of isolated or small aggregates of 
cells into the stroma is observed (black arrowheads). n = 13 mice. Scale bars, 
100 um. 


using three-dimensional live-cell imaging. Wild-type organoids 
underwent mitosis without showing any major abnormalities. We 
did not observe an increase in the percentage of errors in APC*° 
organoids (Fig. 4a and Supplementary Videos 1 and 2). However, 
APC*°/P53° organoids showed a progressive increase in the per- 
centage of errors. We mainly observed anaphase bridges, but a few 
misaligned and lagging chromosomes were also detected (Fig. 4a, b and 
Supplementary Video 3). Importantly, compared to APCS°/P538° 
organoids, triple and quadruple mutants showed only a minor increase 
in the percentage of mitotic errors (Fig. 4a and Supplementary 
Videos 3-5), implying that loss of APC and P53 is sufficient to 
acquire CIN. 
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Figure 4 | Progressive CIN and aneuploidy upon introduction of CRC 
mutations. a, Live-cell imaging was performed to monitor chromosome 
segregations. Graph shows the percentage of erroneous mitoses. Each dot 
represents the percentage of errors in one organoid. Horizontal bars represent 
median of all dots. Videos are included of organoids depicted as dots with green 
outline (Supplementary Videos 1-5). WT, wild-type; KO, knockout. b, Stills of 
a typical erroneous mitotic event (anaphase bridge) in an APC°/P538° 
organoid. Time points are indicated in minutes relative to prophase onset. Scale 
bars, 5 tum. c, Chromosomes were counted in the indicated organoids. Graphs 
plot the percentage of cells with chromosome counts <44, 44-48 (normal) and 
>48 (at least 50 spreads were counted). d, Representative karyotypes of a wild- 
type culture with n = 46 chromosomal counts (left) and APCS organoid 
culture (right) with aberrant chromosome numbers. Scale bars, 25 jim. 


To verify that the observed CIN results in aneuploidy, we next 
counted chromosome numbers. Unlike wild-type organoids, karyo- 
typing reproducibly revealed numerical aberrations in a low percent- 
age of APCK° organoids (two independent sgRNAs) (Fig. 4c, d). This 
ranged from a trisomy of chromosome 7 to near-tetraploid meta- 
phases (Extended Data Fig. 6a), the latter confirming previous studies 
in mouse embryonic stem cells”. Strikingly, one of the most recurrent 
chromosomal aberrations in low-grade colorectal adenomas in 
patients involves copy number gains of chromosome 7 (refs 24-26). 
In accordance with the chromosome segregation analyses, APC<°/ 
P53° organoids showed a marked increase in the percentage of 
aneuploid spreads (Fig. 4c and Extended Data Fig. 6b). The triple 
and quadruple mutants also showed extensive aneuploidy (Fig. 4c 
and Extended Data Fig. 6c, d). 

To confirm these data, APCS°, P53K° and APC*°/P53*° orga- 
noids were engineered in a second human small intestinal line 
(Extended Data Fig. 7a-c). Again, loss of both APC and P53 had 
the most dramatic effect on CIN and aneuploidy. Although the single 
loss of P53 resulted in a substantial increase in the percentage of 
segregation errors, only a minor increase in the amount of aberrant 
spreads was observed (Extended Data Fig. 7d, f and Supplementary 
Video 6). Thus, we show that the combined loss of APC and P53 is 
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sufficient for the appearance of extensive aneuploidy. Despite the 
observed chromosome missegregations, our engineered lines con- 
tinue proliferating, while maintaining functional DNA damage sig- 
nalling (Extended Data Fig. 7h). 


CRC mutations in human colon organoids 


Finally, we introduced all the mutation combinations described earl- 
ier into a human colon organoid stem cell culture”’, following the 
same functional selection procedures (Extended Data Fig. 3 and 
Extended Data Table 1b). Importantly, this yielded essentially ident- 
ical results to those obtained with the small intestinal stem cells, in 
terms of growth factor independence, in vitro appearance, CIN and 
aneuploidy (Extended Data Figs 5b and 7e, g). Moreover, both triple- 
and quadruple-mutant human colon organoids grew with high effi- 
ciency as tumours upon xenotransplantation into immunodeficient 
mice (Extended Data Fig. 8a, b). Histological analysis revealed that 
triple-mutant tumours contained large cysts and locally displayed 
features of well-differentiated carcinomas with relatively limited 
invasive growth, whereas the quadruple-mutant-derived invasive car- 
cinomas were faster growing, had a poorly differentiated appearance 
and displayed very frequent tumour budding at the invasive front, as 
well as invasion of the underlying muscle tissue (Extended Data Fig. 
8c-e). 

While this manuscript was under final review, a study using a 
similar strategy appeared”*. Our CRC progression model selects out 
functional mutants by changing the culture medium composition and 
all sgRNAs were designed to target mutation hotspot regions. 
Therefore, we believe that our model reflects the in vivo situation 
more closely than any other in vitro human CRC model so far. 
Upon oncogenic mutation of KRAS, APC, P53 and SMAD4, human 
gut stem cell organoids can grow in the absence of all stem-cell-niche 
factors and in the presence of the P53 stabilizer nutlin-3 in vitro and as 
tumours with invasive carcinoma features in vivo. Moreover, we find 
that our engineered CRC organoid lines show marked CIN and aneu- 
ploidy, both considered to be hallmarks of cancer’. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Human material for organoid cultures. Approval for this study was obtained by 
the ethics committees of the University Medical Centre Utrecht (duodenal biops- 
ies) and The Diakonessen Hospital Utrecht (colonic tissues). Written informed 
consent was obtained. 

Organoid culture. Endoscopic duodenal biopsy samples were obtained from two 
female individuals (patient 1, age 2 years; patient 2, age 8 years). These individuals 
were admitted for suspected coeliac disease or dyspepsia. Upon immunological 
and pathophysiological analysis, none of the individuals was diagnosed with 
coeliac disease, whereas patient 1 presented with signs of gastric metaplasia. All 
duodenal biopsies that were used in this study were found to be healthy on the 
basis of histological examination. Normal human colon tissue was isolated from a 
resected colon segment derived from a patient (female, age 60 years) diagnosed 
with CRC (sigmoid). Culture establishment was described previously’””’. Culture 
medium contains advanced DMEM/F12 medium (Invitrogen) including B27 
(Invitrogen), nicotinamide (Sigma-Aldrich), N-acetylcysteine (Sigma-Aldrich), 
noggin (Peprotech), R-spondin 1 (ref. 29), EGF (Peprotech), WNT conditioned 
media (50%, produced using stably transfected L cells), TGF-B type I receptor 
inhibitor A83-01 (Tocris) and P38 inhibitor $B202190 (Sigma-Aldrich). For 
selection of KRAS@'”? mutants, organoids were grown in culture medium lacking 
EGF and containing 0.5-1.0 [1M of gefitinib (Selleck Chemicals). For mutant P53 
selection, organoids were cultured in the presence of 5-10 1M nutlin-3 (Cayman 
Chemical). Organoids were repeatedly tested for mycoplasma contamination and 
resulted negative. 

Organoid transfection and genotyping. The organoid lipofection protocol was 
previously described in detail'’. In short, human organoids were grown in the 
media described earlier, and trypsinized for 10 min at 37 °C. After trypsinization, 
cells were resuspended in 450 ul growth medium (containing the Rho kinase 
inhibitor Y-27632) and plated in 48-well plates at high density (80-90% conflu- 
ent). Nucleic acid-Lipofectamine 2000 complexes were prepared according to the 
standard Lipofectamine 2000 protocol (Invitrogen). Four microlitres of 
Lipofectamine 2000 reagent in 50 pl Opti-MEM medium (Gibco), and a total 
of 1.5 ig of DNA (sgRNA, Cas9, with/without oligonucleotide in 50 kl Opti- 
MEM medium) were mixed together, incubated for 5 min, and added to the cells 
(50 ul per well). The plate was centrifuged at 600g at 32 °C for 1 h, and incubated 
for 4 h at 37 °C before single cells were plated in Basement Membrane Extract 
(BME; Amsbio) or Matrigel (BD Biosciences). Growth medium plus Y-27632 was 
exchanged with selection medium 3 days after transfection. For clonal expansion 
single organoids were picked. On average, the efficiency of introduction of frame- 
shift-inducing mutations was approximately 1%. sgRNA transfections and sub- 
sequent selections were performed at least three times in both human small 
intestine and colon lines. 

For genotyping, genomic DNA was isolated using Viagen Direct PCR 
(Viagen). Primers for the PCR amplification using GoTaq Flexi DNA polymerase 
(Promega) were as follows: APC_for, 5’-TGTAATCAGACGACACAGGAAG 
CAGA-3', APC_rev, 5'-TGGACCCTCTGAACTGCAGCAT-3'; P53_for, 5’- 
CAGGAAGCCAAAGGGTGAAGA-3', P53_rev, 5'-CCCATCTACAGTCCCC 
CTTG-3'; KRAS_for, 5'-TGGACCCTGACATACTCCCA-3’, KRAS_rev, 5’- 
AAGCGTCGATGGAGGAGTTT-3’; SMAD4_for, 5'-TGGAGTGCAAGTGA 
AAGCCT-3', SMAD4_rev, 5'’-ACCGACAATTAAGATGGAGTGCT-3’. Prod- 
ucts were cloned into pGEM-T Easy vector system I (Promega) and subsequently 
sequenced using T7 sequencing primer. 

Vector construction. The human codon-optimized Cas9 expression plasmid was 
obtained from Addgene (41815). The ssRNA-GFP plasmid was obtained from 
Addgene (41819) and used as a template for generating target-specific sgRNAs. 
The GFP targeting sequence was exchanged by inverse PCR followed by DpnI 
digestion and T4 ligation as described previously'”. APC, P53 and SMAD4 sgRNA 
sequences are included in Extended Data Figs 1a and 2d. KRAS target sequences: 
number 1, 5’-GAATATAAACTTGTGGTAGTTGG-3’; number 2, 5’- 
GTAGTTGGAGCTGGTGGCGTAGG-3’. 

RNA isolation, cDNA preparation and qRT-PCR. Organoids were harvested in 
RLT lysis buffer and RNA was isolated using the Qiagen RNeasy kit (Qiagen) 
according to the manufacturer’s instructions. Extracted RNA was used as a tem- 
plate for cDNA production using GoScript reverse transcriptase (Promega) 
according to the manufacturer’s protocol. qRT-PCR was performed using IQ 
SYBR green mix (Bio-Rad) according to the manufacturer’s protocol. Results 
were calculated by using the AACt method. Organoid treatments: WNT/R-spon- 
din withdrawal, 48 h; nutlin-3 10 1M, 24 h. Primer sequences: AXIN2_for, 5'- 
AGCTTACATGAGTAATGGGG-3’, AXIN2_rev, 5'-AATTCCATCTACACTG 
CTGTC-3'; P21_for, 5'-TACCCTTGTGCCTCGCTCAG-3’, P2I_rev, 5’- 
GAGAAGATCAGCCGGCG TTT-3'; GAPDH_for, 5'-TGCACCACCAACTG 
CTTAGC-3', GAPDH_rev, 5'-GGCATGGACTGTGGTCATGAG-3’. 


Western blot. Samples were lysed using RIPA buffer (50 mM Tris-HCl pH 8.0, 
150 mM NaCl, 0.1% SDS, 0.5% Na-Deoxycholate, 1% NP-40) containing 
Complete protease inhibitors (Roche). Protein content was quantified using 
standard Bradford assay (BioRad) and equal amounts of protein were run on 
SDS-PAGE gels and transferred to PVDF membranes (Millipore). For APC 
western blotting, protein lysates were loaded on gradient polyacrylamide gels 
(4-15%; BioRad) and subsequently transferred. Membranes were blocked and 
probed with antibodies directed against P53 (DO-1, Santa Cruz Biotechnology), 
P21 (F-5, Santa Cruz Biotechnology), APC (FE9, Calbiochem), SMAD4 (B8, 
Santa Cruz Biotechnology), phospho-Chk1 Ser 345 (Bioke) and GAPDH (ab- 
9485, Abcam). Organoid treatments: nutlin-3 10 1M, 24 h; doxorubicin 10 1M, 
overnight. Uncropped versions of the most relevant images are provided in 
Supplementary Fig. 1. 

In vivo transplantation assays. Approval for this study was obtained by the 
Animal Experimentation Committee at the Academic Medical Centre in 
Amsterdam (DEC102581). Human organoid lines were expanded in their cor- 
responding selection media and trypsinized for 10 min at 37 °C. After trypsiniza- 
tion, 200,000 cells were resuspended in 50 pl of medium containing 2 required 
growth factors, mixed with Matrigel (BD Biosciences) at a 1:1 ratio and injected 
subcutaneously into NOD scid gamma (NSG; NOD.Cg-Prkde*# T2rg’”"™"/SzJ) 
mice (26 injections per organoid line). After 7 (colon) or 8 (small intestine) 
weeks, mice were killed and nodules were processed for analysis. Both males 
and females (aged 8-10 weeks at the start of the experiment; weights, ~30 g 
for males and ~25 g for females) were used. This was randomly distributed 
and does not affect outgrowth. All animals were included in the analysis. Ear 
clipping was used for animal recognition. Number of injections was chosen 
following previous experience in the assessment of experimental variability. 
Animals were caged together and treated in the same way. 
Immunohistochemistry. Tissues were fixed in 4% paraformaldehyde, dehy- 
drated and embedded in paraffin. Sections were subjected to H&E as well as 
immunohistochemical staining. The following primary antibodies were used 
for immunohistochemical staining: anti-cytokeratin clone Cam5.2 (BD 
Biosciences), anti-Ki67 clone MM1 (Sanbio) and E-cadherin clone 36 (BD 
Biosciences). 

Live-cell imaging and karyotyping. To visualize mitoses, organoids were 
infected with lentivirus encoding mNeon-tagged histone 2B and a puromycin- 
resistance cassette (pLV-H2B-mNeon-ires-Puro)”*’. After two passages, these 
were plated in BME in glass-bottom 96-well plates and mounted on an inverted 
confocal laser scanning microscope (Leica SP8X), which was continuously 
held at 37 °C and equipped with a culture chamber for overflow of 6.0% COs. 
Over 16-20 h, ~10 H2B-mNeon-expressing organoids were imaged simulta- 
neously in XYZT-mode using a X40 objective (N.A. 1.1), using minimal amounts 
of 506 nm laser excitation light from a tuneable white light laser. Time interval 
was approximately 3 min (2:30-3:20 min). For post-acquisition analyses of 
mitotic behaviour, data sets were converted into manageable and maximally 
informative videos, combining z-projection, depth colour-coding and merging 
with transmitted light images (Supplementary Videos 1-6). Mitoses were scored, 
judged and counted manually. 

For karyotyping, organoids were treated with 0.1 pg ml‘ colcemid (Gibco) for 
16h. Cultures were washed and dissociated into single cells using TrypLE (Gibco) 
and processed as described’. Slides were mounted with DAPI-containing vecta- 
shield and analysed on a DM6000 Leica microscope (at least 50 spreads were 
analysed, n = 3). 

Off-target effect analysis. To assess off-target mutational effects, we compu- 
tationally identified candidate off-target sites for each sgRNA using COD 
software (http://cas9.wicp.net/). The software calculates an off-target score 
depending on sequence similarity: if the sequence perfectly matches the tested 
sgRNA (the target site) the score is 1 and decreases with increasing sequence 
differences. For the sgRNA targeting P53 and SMAD4 we identified 2 and 11 
candidate off-target sites (Supplementary Table 1). For the sgRNA targeting 
APC and KRAS we only considered sites with an off-target scores of at least 
0.15 or higher, resulting in 74 and 15 candidate off-target sites, respectively. 
We evaluated off-target mutational effects by amplicon-based NGS sequen- 
cing 93 candidate off-target sites and included the target sites for P53, KRAS 
and SMAD4 as positive controls (Supplementary Table 1). To this end, pri- 
mers were designed ~350 nucleotides 5’ and ~150 nucleotides 3’ from the 
candidate site to obtain amplicons of ~500 bp (primer sequences available 
upon request). These regions were PCR amplified for each of the cultures 
using 5 ng genomic DNA, 1X GoTaq PCR Buffer (Promega), 1.5 mM MgCl, 
0.2 mM of each dNTP, 0.2 mM of each primer of a primer pair and 0.25 
units of GoTaq polymerase (Promega) in a final volume of 10 ml at 94 °C for 
60 s; 15 cycles at 92 °C for 30 s, 65 °C for 30 s with a decrement of 0.2 °C 
per cycle and 72 °C for 60 s; followed by 30 cycles at 92 °C for 30 s, 58 °C for 


©2015 Macmillan Publishers Limited. All rights reserved 


30 s and 72 °C for 60 s; and a final extension at 72 °C for 180 s. Per culture 
the PCR products were pooled and barcoded. Illumina sequence libraries 
were generated using the TruSeq DNA Sample Preparation Kit (Illumina) 
according to the manufacturer’s protocol. Subsequently, the libraries were 
pooled and sequenced using the MiSeq sequencer (2 X 250 bp) to a depth 
of >10,000X base coverage. Sequence reads were mapped to the human 
reference genome (GRCh37/hg19), using the Burrows-Wheeler Aligner 
(BWA) Maximal Exact Matches (MEM) v.0.7.5a mapping tool’! with settings 
“-c 100 -m’. Small indel calling was performed using the Genome Analysis 
Toolkit (GATK)” haplotype caller v.3.2-2 with ‘best practices’ settings. We 
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only considered indels with a variant allele frequency (VAF) of at least 0.15 or 
higher (Supplementary Table 1). 
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Extended Data Figure 1 | Introducing inactivating mutations in the APC _ primers flanking the targeted exon. Subsequent sequencing revealed indels at 
and P53 genes in human intestinal organoids using CRISPR/Cas9. the expected locations. PAM sequences are underlined in red in wild-type 

a, Schematic representation of the targeted exon of the human APC (left) and sequences. Of note, the curved lines bridging the gaps in deleted alleles are 
P53 (right) loci and sequences of the designed sgRNAs. b, c, PCRamplification drawn by the alignment software. 

products of the mutated alleles of APC (b) and P53 (c) were obtained using 
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Extended Data Figure 2 | KRAS®!”?, APC, P53*° and SMAD4° 
mutation combinations in human intestinal organoids. a—c, PCR 
amplification products of the indicated genes of KRAS@!7?/APC%° 

(a), KRAS@?/A PCK°/P53*° (b) and KRAS@?P/A PCS°/P53%°/SMAD4E° 
(c) organoids were obtained using primers flanking the targeted exon. 
Subsequent sequencing revealed indels at the expected locations. PAM 
sequences are underlined in red. Of note, the curved lines bridging the gaps in 
deleted alleles are drawn by the alignment software. d, Schematic 
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representation of the targeted exon of the human SMAD4 locus and sequences 
of the designed sgRNAs. e, RT-PCR for AXIN2 (top) and P21 (bottom) in the 
indicated organoid cultures. Top, the indicated organoid lines were cultured in 
the presence (WENR) or absence (EN) of WNT/R-spondin. Bottom, the 
indicated organoid lines were cultured in the presence or absence of nutlin-3 for 
24h. Expression was normalized to GAPDH. Horizontal bars represent mean of 
n = 3 independent experiments. 
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Extended Data Figure 3 | Using CRISPR/Cas9-mediated genome editing to 
introduce APC, P53, KRAS®!?? and SMAD4 mutations in human colonic 
organoids. a-d, Using the strategies depicted in Figs 1a and 2a, d, APCK, 
APCK°/P53®° (a), KRASS?P (b), KRASS79/APC®®, KRASS??/4 PCK9/ 
P538°, KRAS@??/A PCS9/P53%°/SMAD4®° (c) and P53*° (d) mutant human 
colon organoids were generated. Experiment was performed at least three 
independent times for each mutation. e, GRT-PCR for AXIN2 in the indicated 
organoid lines cultured in the presence (WENR) or absence (EN) of WNT/R- 
spondin. Expression was normalized to GAPDH. Horizontal bars represent 
mean of n = 3 independent experiments. f, Western blot analysis of P53 and 
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P21 expression in the indicated human colon organoid lines cultured in the 
presence or absence of nutlin-3. GAPDH, loading control. g, qRT-PCR for 
AXIN2 in the indicated organoid lines cultured in the presence (WENR) or 
absence (EN) of WNT/R-spondin. Expression was normalized to GAPDH. 
Horizontal bars represent mean of n = 3 independent experiments. h, Western 
blot analysis of SMAD4 and P53 expression in the indicated human colon 
organoid lines. Please note that quadruple-mutant clone 1 contains SMAD4 
frameshift-inducing indels in both alleles whereas clone 2 contains a 
frameshift-inducing indel in one and an in-frame deletion in the other allele 
(reduced SMAD4 expression). GAPDH, loading control. Scale bars, 100 jm. 
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Extended Data Figure 4 | Quadruple-mutant human intestinal organoids 
grow as tumours with features of invasive carcinoma in vivo. a, Wild-type 
and all engineered human intestinal organoid lines were injected 
subcutaneously in immunodeficient mice. Mice inje cted with KRAS@??/ 
APC“°/P53°° (triple) and KRAS®!7?/APC*°/P53°/SMAD4°° (quadruple) 
organoids developed visible nodules. b, Tumour sizes were examined 8 weeks 


after transplantation. c, d, H&E (top left, bottom left), hKRT (top right, bottom 
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middle) and Ki67 (bottom right) immunostainings on nodules isolated from 
triple- (c) and quadruple-mutant (d) injected mice. Triple-mutant organoids 
did engraft but remained small, showed only weak proliferation and had 
adenoma features (n = 3 mice). Quadruple-mutant-derived tumours were 
highly proliferative with features of invasive carcinoma (n = 13 mice). See Fig. 3 
for more details. Scale bars, 100 um. 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


wild-type 


human small intestine organoids 


wild-type quadruple 


human colonic org 


Loy 


38H 


38H 


wild-type quadruple 


ullaypeo-3 


human small intestine organoids 


©2015 Macmillan Publishers Limited. All rights reserved 


191) 


ARTICLE 


Extended Data Figure 5 | Histological analysis of triple- and quadruple- (n = 3 independent experiments). c, Representative E-cadherin 

mutant organoids reveals morphological changes in vitro. a, Representative immunostainings on wild-type and quadruple-mutant human small intestinal 
H&E and Ki67 immunostainings on the indicated human small intestinal organoids (n = 4 independent experiments). Asterisk indicates residual 
organoid lines (n = 4 independent experiments). b, Representative H&E Matrigel. Scale bars, 100 jim. 


and Ki67 immunostainings on the indicated human colon organoid lines 
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Extended Data Figure 6 | Progressive aneuploidy upon introduction of mutations (20 spreads were analysed per line). Note the occurrence of trisomy 7 
CRC mutations. a-d, ; Karyograms of APC®® (a), APCX°/P53*° in APC®® and APC*°/P53®° (independent clones) organoids. M, marker 


(b), KRAS°??/APC*°/P53°° (c) and KRAS°?/APC*°/P53°°/SMAD4°° chromosomes. 
(d) organoids, showing extensive aneuploidy in organoids harbouring CRC 
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Extended Data Figure 7 | Loss of both APCand P53 results in extensive CIN 
and aneuploidy. a, APC ©” P53K° and APCS°/P53*° mutations were 
introduced in a second independent human intestinal organoid line. PCR 
amplification products of the mutated alleles of APC and P53 were obtained 
using primers flanking the targeted exon. Subsequent sequencing revealed 
frameshift-inducing indels at the expected locations. Left, APC genotyping; 
right, P53 genotyping. PAM sequences are underlined in red. Of note, the 
curved lines bridging the gaps in deleted alleles are drawn by the alignment 
software. b, Western blot analysis for P53 and P21 expression in the second 
human intestinal organoid line cultured in the presence or absence of nutlin-3. 
GAPDH, loading control. c, RT-PCR for AXIN2 in the second human 
intestinal organoid line cultured in the presence (WENR) or absence (EN) of 
WNT/R-spondin. Expression was normalized to GAPDH. Horizontal bar 


represents mean of n = 3 independent experiments. d, Chromosome numbers 
were counted in the second human intestinal organoid lines. Graphs plot the 
percentage of cells with chromosome counts <44, 44-48 (normal) and >48 (at 
least 50 spreads were counted). e, As in d, but for indicated human colon 
organoid lines. f, Live-cell imaging was performed to monitor chromosome 
segregations in the indicated human small intestinal organoid lines. Graph 
shows the percentage of erroneous mitoses. Each dot represents the percentage 
of errors in one organoid. Horizontal bars represent median of all dots. A video 
is included of organoids depicted as dots with green outline (Supplementary 
Video 6). WT, wild type; KO, knockout. g, As in f, but for indicated human 
colon organoid lines. h, Western blot analysis of phospho-CHK1 and P53 
expression in the indicated organoid lines treated with the DNA-damaging 
drug doxorubicin, or left untreated. GAPDH, loading control. 
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Extended Data Figure 8 | Engineered mutant human colon organoids grow 
as invasive carcinomas in vivo. a, Wild-type, triple- and quadruple-mutant 
human colon organoids were injected subcutaneously in immunodeficient 
mice. Nodules were counted 7 weeks after transplantation. b, Tumour sizes 
were examined 7 weeks after transplantation. c, Representative pictures of a 
‘cystic’ triple-mutant (left) and ‘solid’ quadruple-mutant (right) tumour in 
immunodeficient mice. d, H&E (top left, bottom left), hKRT (top middle, 
bottom middle) and Ki67 (top right, bottom right) immunostainings on 
nodules isolated from triple-mutant-injected mice. Representative pictures of a 


well-differentiated carcinoma with limited invasive growth. The invasive 
growth has an expansive growth pattern with little tumour budding. n = 6 
mice. e, As in d but for quadruple-mutant-derived tumours. Representative 
pictures of a poorly differentiated invasive carcinoma with frequent tumour 
budding at the invasive front (invasion of isolated or small aggregates of cells 
into the stroma is frequently observed (black arrowheads)). Invasive character 
is confirmed by the invasive growth into the underlying muscle tissue (asterisk, 
muscle tissue). 1 = 8 mice. Scale bars, 100 um. 
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Extended Data Table 1 | Introducing oncogenic mutations in human intestinal organoids using CRISPR/Cas9 
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a 
Gene targeted | Selection procedure # of functional sgRNAs 
APC | - Wnt - R-spondin 2 out of 4 
P53 | + Nutlin-3 3 out of 4 
KRAS | - EGF (+ gefitinib) 2 out of 2 
SMAD4 | - Noggin 2 out of 4 
b 
Organ Organoid line Mutation Genotype In vitro phenotype In vivo 
order tumorigenicity 
Small Wild-type - -; stable genome No 
intestine 
Small APC? APC APC FS_pro1443/FS_pro1443 - Wnt - R-spondin; mild No 
intestine (SsgRNA 4) aneuploidy 


Small 
intestine 


APC /P53" 
(sgRNA 4/3) 


APC>P53 


APC FS_pro1443/FS_pro1443 
P53 FS_phe109/FS_phe109 


- Wnt - R-spondin + Nut-3; 
extensive aneuploidy 


Small 
intestine 


KRASS™ 
(sgRNA 2) 


KRAS 


KRAS c.35G>A/FS_gly13 


- EGF + gefitinib; N.D. 


Small 
intestine 


Small 
intestine 


KRAS*™ APC 
(sgRNA 2/4) 


KRAS°™"/ APC /P53" 
(sgRNA 2/4/3) 


KRAS>APC 


P53 


KRAS>APC, 


KRAS c.35G>A/FS_gly13 
APC FS_pro1443/FS_pro1443 
KRAS c.35G>A/FS_gly13 
APC FS_pro1443/FS_pro1443 
P53 FS_phe109/FS_phe109 


- EGF - Wnt - R-spondin; 
N.D. 


- EGF - Wnt - R-spondin + 
Nut-3; extensive aneuploidy 


Adenoma 


Small 
intestine 


KRAS°™ APC 
P53°/SMAD4° 
(sgRNA 2/4/3/1) 


P53,SMAD4 


KRAS>APC, 


KRAS c.35G>A/FS_gly13 

APC FS_pro1443/FS_pro1443 
P53 FS_phe109/FS_phe109 
SMAD4 DEL_pro356/FS_pro356 


- EGF - Wnt - R-spondin - 
Noggin + Nut-3; extensive 
aneuploidy; grow as solid 
tumour masses 


Invasive 
carcinoma 


Wild-type 


-; stable genome 


Colon 


APC? 
(SsgRNA 4) 
P53z° 


P53 


APC FS_pro1443/FS_pro1443 


P53 FS_phe109/FS_phe109 


- Wnt - R-spondin; mild 
aneuploidy 


+ Nut-3; mild aneuploidy 


N.D. 


(sgRNA 3) 
Colon APC /P53° APC>P53 APC FS_pro1443/FS_pro1443 - Wnt - R-spondin + Nut-3; N.D. 
(sgRNA 4/3) P53 FS_gly108/FS_gly108 extensive aneuploidy 
Colon KRASS™ KRAS KRAS ¢.35G>A/c.35G>A - EGF + gefitinib; N.D. N.D. 
(sgRNA 2) 
Colon KRASS™ APC? KRAS>APC_ | KRASc.35G>A/c.35G>A - EGF - Wnt - R-spondin; N.D. 
(sgRNA 2/4) APC FS_pro1443/FS_pro1441 N.D. 
Colon KRAS® APC /P53© | KRAS>APC, | KRAS c.35G>A/c.35G>A -EGF-Wnt-R-spondin+ | Well 
(sgRNA 2/4/3) P53 APC FS_pro1443/FS_pro1443 Nut-3; extensive aneuploidy Sulaiman 
= _ carcinoma WI 
P53 FS_phe109/FS_phe109 limited invasive 
growth 
Colon KRAS*™/APC/ KRAS>APC, | KRAS c.35G>A/c.35G>A - EGF - Wnt - R-spondin - Poorly 
P53*°1SMAD4° P53,SMAD4 APC FS_pro1443/FS_pro1443 Noggin + Nut-3; extensive differentiated 
(sgRNA 2/4/3/1) ie = aneuploidy; grow as solid carcinoma with 
P53 FS_gly105/FS_gly105 tumour masses frequent 


SMAD4 DEL_pro356/FS_pro356 


invasive growth 


a, Overview of the number of functional sgRNAs and the selection strategy used. b, Overview of the engineered lines. N.D., not determined. 
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Structural basis for Na* transport 
mechanism by a light-driven Na* pump 


Hideaki E. Kato't, Keiichi Inoue?**, Rei Abe-Yoshizumi’, Yoshitaka Kato”, Hikaru Ono’, Masae Konno’, Shoko Hososhima>”®, 
Toru Ishizuka®*°, Mohammad Razuanul Hoque”®, Hirofumi Kunitomo’, Jumpei Ito’, Susumu Yoshizawa®, Keitaro Yamashita’, 
Mizuki Takemoto!, Tomohiro Nishizawal, Reiya Taniguchi’, Kazuhiro Kogure®, Andrés D. Maturana’, Yuichi Iino!’®, 

Hiromu Yawo”®, Ryuichiro Ishitani!, Hideki Kandori*? & Osamu Nureki! 


Krokinobacter eikastus rhodopsin 2 (KR2) is the first light-driven Na* pump discovered, and is viewed as a potential 
next-generation optogenetics tool. Since the positively charged Schiff base proton, located within the ion-conducting 
pathway of all light-driven ion pumps, was thought to prohibit the transport of a non-proton cation, the discovery of 
KR? raised the question of how it achieves Na* transport. Here we present crystal structures of KR2 under neutral and 
acidic conditions, which represent the resting and M-like intermediate states, respectively. Structural and spectroscopic 
analyses revealed the gating mechanism, whereby the flipping of Asp116 sequesters the Schiff base proton from the 
conducting pathway to facilitate Na‘ transport. Together with the structure-based engineering of the first light-driven 
K* pumps, electrophysiological assays in mammalian neurons and behavioural assays in a nematode, our studies reveal 
the molecular basis for light-driven non-proton cation pumps and thus provide a framework that may advance the 


development of next-generation optogenetics. 


Many organisms capture light energy and information using the rho- 
dopsin family of proteins, which comprise the heptahelical trans- 
membrane (7-TM) proteins called opsins covalently linked to 
retinal. Based on their primary sequences, the opsin genes are clas- 
sified into two groups: the microbial and animal opsins. The animal 
rhodopsins primarily work as G-protein-coupled receptors, whereas 
the microbial rhodopsins have divergent functions, such as ion 
pumps, ion channels, sensors and kinases’*. Recently, the pump- 
and channel-type rhodopsins have attracted broad attention, since 
these microbial rhodopsins can be used as powerful tools in the neu- 
roscience field to control neuronal activity in a wide range of living 
animals (optogenetics)*°. 

As compared to the light-gated ion channel channelrhodopsin 
(ChR), the light-driven ion pumps have a long research history. 
Since the discoveries of the light-driven proton pump bacteriorhodop- 
sin (BR) and the light-driven chloride pump halorhodopsin (HR)’*, 
several light-driven ion pumps, such as proteorhodopsins (PRs), 
xanthorhodopsin (XR), and archaerhodopins (ARs), have been cloned 
and studied in diverse research fields, including optogenetics”””. 
However, these light-driven ion pumps were basically classified into 
only two groups: outward proton pumps and inward chloride pumps, 
and no non-proton cation pumps have been discovered. Almost all of 
the known microbial rhodopsins are covalently bound to all-trans 
retinal (ATR) via the protonated Schiff base in the resting state, and 
the positively charged Schiff base proton in the middle of the ion 
transport pathway prevents cation transport (Extended Data Fig. 1). 
In the case of proton pumps, the Schiff base proton itself works as the 
substrate, and retinal photoisomerization alters the pK, values of 
the Schiff base and the carboxylates located on the extracellular side 
(historically called the ‘Schiff base counterions’), consequently leading 


to the proton transfer from the Schiff base to the extracellular side 
(Extended Data Fig. 1a)*’’. In the case of chloride pumps, Cl” binding 
stabilizes the protonated Schiff base and retinal photoisomerization 
flips the N-H dipole, thus driving the movement of Cl” from the 
extracellular environment to the intracellular side (Extended Data 
Fig. 1b)°"*. Therefore, it was widely believed that the absence of 
light-driven non-proton cation pumps was reasonable, because a 
non-proton cation would experience electrostatic repulsion from the 
Schiff base proton (Extended Data Fig. 1c). 

However, in 2013, a new microbial rhodopsin, containing the 
unique NDQ (Asn112, Asp116 and Gln123) motif, was cloned from 
the marine flavobacterium Krokinobacter eikastus, and was character- 
ized as the first light-driven Na* pump". Although the new rhodop- 
sin, Krokinobacter eikastus rhodopsin 2 (KR2), transports protons in 
the absence of Na* and Li*, under physiological conditions it works 
solely as an outward Na* pump. A previous study revealed its cyclic 
photochemical reaction (photocycle) involving four spectroscopically 
distinguishable intermediates (K, L, M and O) and identified some of 
the functionally important residues’*"®, but the mechanism of Na’ 
transport remained elusive. Understanding the structural basis of non- 
proton cation transport by KR2 would be enormously valuable, not 
only to enhance our knowledge of rhodopsin family proteins, but also 
to facilitate the design of novel light-driven ion pumps with desired ion 
selectivity, such as K* and Ca”* pumps. Such ion pumps, as well as 
Na* pumps, would be useful as next-generation optogenetics tools. 


Overall structure and comparison with BR 

To understand the structural basis for Na* transport, we expressed, 
purified and crystallized KR2 lacking the five carboxy-terminal amino 
acid residues (residues 1-275). The crystals were obtained by the lipidic 
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Figure 1 | Overall structure of KR2 and comparison with BR. Crystal 
structure of KR2, viewed parallel to the membrane (left), and from the 
extracellular side (right). KR2 consists of the N-helix and the seven 
transmembrane helices (TM1-TM7) connected by extracellular loops 
(ECL1-ECL3) and intracellular loops (ICL1-ICL3). ATR, depicted by stick 
models, is coloured blue, and ECL1 is coloured green. 


cubic phase (LCP) method” under low pH conditions (pH 4.0). The 
structure was determined by the molecular replacement method, using 
the coordinates of XR (PDB ID: 3DDL)'’, and refined to 2.3 A resolu- 
tion (Extended Data Fig. 2a). The crystals belonged to the 1222 space 
group, and contained one KR2 protomer in the asymmetric unit 
(Extended Data Fig. 2b). Except for the four disordered amino-terminal 
residues (residues 1-4), KR2 (residues 5-275), ATR, three lipids, one 
polyethylene glycol (PEG), and several water molecules are clearly 
resolved in the electron density map (Extended Data Fig. 2c). 

The structure of KR2 is composed of an extracellular N-terminal 
helix (N-helix; residues 10-17) and 7-TM domains (residues 21-268), 
connected by three intracellular loops (ICL1 to ICL3) and three extra- 
cellular loops (ECL1 to ECL3) (Fig. 1). To understand the structure- 
function relationship of KR2 in detail, we compared the structure of 
KR2 with that of the best-characterized light-driven proton pump, BR 
(PDB ID: 1C3W)””. Consistent with the relatively high sequence sim- 
ilarity (46%) (Extended Data Fig. 3), the overall structure of KR2 
superimposed well on that of BR (root mean square deviation 
(r.m.s.d.) of 1.83.A over all Ca atoms) (Extended Data Fig. 4). 
However, KR2 has two distinct features on its extracellular side: the 
unique conformation of ECL1 and the presence of a short N-helix. 


ECL1 and N-helix 


In KR2, ECL1 (residues 86-106), forming the antiparallel B-sheet, has 
an entirely different orientation as compared to that in BR (Figs 1, 2a- 
c and Extended Data Fig. 4a). It protrudes into the extracellular side, 
and its conformation is rather similar to that in XR'* (Extended Data 
Fig. 4b). Notably, the four carboxylates (Glu90, Glu91, Asp98 and 
Asp102) on ECLI1 are positioned separately and oriented towards 
the bulk solvent, although they are considered to contribute to the 
extracellular Na* -binding site, and to compose part of the exit path- 
way for Na* conduction'® (Fig. 2b). To analyse the functions of these 
carboxylates in more detail, we first measured the level of Na* bind- 
ing to the E90Q/E91Q, D98N, and D102N mutants, using attenuated 
total reflection Fourier-transform infrared (atrFTIR) spectroscopy. In 
this experiment, we monitored the infrared spectral changes upon 
Na™ binding, and the difference spectra clearly showed that among 
the four carboxylates, only Asp102 contributes to the extracellular 
Na*-binding site (Extended Data Fig. 5a). We next examined the 
functions of the D102N mutant, and found that it has reduced ther- 
mostability, although its activity and kinetics are similar to those of 
wild-type KR2 (Fig. 2d, e and Extended Data Fig. 5b-e). These unex- 
pected results suggested that only Asp102, among the four carboxy- 
lates, forms the extracellular Na’ -binding site, and this site does not 
function as part of the Na* conducting pathway, but contributes to 
the thermostability of KR2. 
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Figure 2 | Structures and functions of the N-helix and ECL1. a-c, Overall 
architecture of KR2 (a), with magnified views of the green (b) and purple 

(c) highlighted regions. The black dashed lines and the red sphere represent 
hydrogen bonds and a water molecule, respectively. d, Pump activities of wild- 
type (WT) KR2 and mutants in E. coli cells. The activities were evaluated by 
measuring the pH changes caused by the proton motive force generated upon 
Na‘ transport by KR2, without (light blue) and with (dark blue) the 
protonophore carbonyl cyanide m-chlorophenylhydrazone (CCCP), as 
previously reported’*. Each tick mark of the y axis represents 0.2 pH change. 
Grey shading represents the dark condition; white indicates illumination. 

e, Thermostability assays of detergent-solubilized wild-type KR2 and mutant 
proteins. a.u., arbitrary units. 


The second novel feature is that KR2 has an additional short 
N-helix, which is located at a similar position to ECL1 in BR 
(Fig. 2c and Extended Data Fig. 4a). This helix interacts with ECL1 
(Extended Data Fig. 5f) and tethers it to the protein core through 
direct and water-mediated hydrogen bonds among Glull1 on the 
N-helix, Glul160 on TM4, and Arg243 (Glu204 in BR) on TM7 
(Fig. 2c). While this short helix has not been found in other microbial 
rhodopsin, it is well conserved among the putative Na*-pump rho- 
dopsins containing the NDQ motif (Extended Data Fig. 3)”°. Thus, we 
tried to analyse the E11A, E160A, and R243A mutant proteins, as well 
as the N-helix-truncated (A1-18) mutant. We successfully expressed 
and purified E11A, E160A, and R243A, but not Al-18 (Extended 
Data Fig. 5e), and the thermostability assay revealed that these three 
mutant proteins were all quite unstable, as compared to the wild-type 
protein (Fig. 2e and Extended Data Fig. 5g). However, in contrast to 
their instabilities in detergent, all three mutants showed slightly 
decreased, but significant, pump activities in Escherichia coli cell 
membranes (Fig. 2d). These results indicate that the N-helix and 
the interaction between the N-helix and the protein core have import- 
ant roles in the structural stability of KR2, rather than its Na*-pump 
activity. This is in contrast to BR, in which Glu204, the residue cor- 
responding to Arg243 in KR2, has an important role in proton release 
to the extracellular side’. 


Intracellular cavity 


As compared to the extracellular region, the intracellular regions of BR 
and KR2 are similar, except for minor structural differences in the ICLs 
and the C-termini (Extended Data Fig. 4a). Notably, KR2 has a narrow 
opening between the intracellular ends of transmembrane domains 1 
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Figure 3 | The intracellular cavity and the retinal binding pocket a, Overall 
architecture of KR2, depicted by a surface representation. The region around 
the intracellular cavity is highlighted in the orange rectangle. b, c, Magnified 
views of the highlighted region in a, with (c) and without (b) the surface 
representation. The water molecules are shown as red spheres. d-f, Photocycle 


and 7. This hydrophilic cavity, formed by Tyr45, Phe46, Thr49, Ser60, 
Asn61 and Gly263, is filled with water molecules and leads to Gln123 
(Fig. 3a-c). Since Gln123 is highly conserved among the putative Na” - 
pump rhodopsins (Extended Data Fig. 3), and the corresponding res- 
idue in BR, Asp96, is known to work as the substrate (proton) donor for 
the deprotonated Schiff base in the late intermediate*”’, it was assumed 
that Gln123 is also involved in substrate (Na™ ) binding in KR2. The 
flash photolysis experiment with the Q123A mutant revealed that the 
lifetimes of the late intermediates in the photocycle of the Q123A 
mutant (t14 = 5.5 ms, To = 22 ms) were more than three times longer 
than the reported lifetimes of wild type (t,y = 1.0 ms, to = 7.9 ms)* 
(Fig. 3d-f). This result suggested that Gln123 captures Na* from the 
bulk solvent and contributes to the formation of the transient intracel- 
lular Na* -binding site, and is consistent with the previous study show- 
ing that the Q123A mutant has significantly decreased pump activity’. 


Schiff base region in two states 


We next focused on the retinal binding pocket and the region around the 
Schiff base. In contrast to the relatively conserved retinal binding pocket 
(Supplementary Discussion and Extended Data Fig. 8), remarkable dif- 
ferences exist in the Schiff base regions between the structures of BR and 
KR2 (Fig. 4). In BR in the resting state, the protonated Schiff base points 
towards the extracellular side and hydrogen bonds with a key water 
molecule between the Schiff base counterions, Asp212 and Asp85. 
Asp212 is fixed by hydrogen bonds with Tyr57 and Tyr185, while 
Asp85, which works as the proton acceptor from the Schiff base in the 
M intermediate, interacts with Thr89 (Fig. 4a). In contrast, Asp85 and 
Thr89 are replaced by Asn112 and Asp116, respectively, in KR2, and the 
substitution of Tyr57 with Leu75 results in the loss of the hydrogen bond 
with Asp251 (Asp212 in BR). The key water molecule between Asn112 
and Asp251 is located near Asp251 and Arg109 (Arg82 in BR), and is no 
longer hydrogen bonded with the Schiff base and Asn112 (Fig. 4b) 
(Supplementary Discussion and Extended Data Fig. 9). The orientation 
of the Schiff base is different from that in BR”', and in KR2 it points 
towards the area between Asp116 and Asn112, rather than towards the 
water (Fig. 4b and Extended Data Fig. 4c, d). We focused on Asp116, 
which is oriented away from the Schiff base and forms hydrogen bonds 
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of the Q123A mutant. d, e, Transient absorption spectra of the Q123A mutant 
(d) and time traces of the absorption changes (e) at specific probe wavelengths. 
f, Photocycle scheme for the Na’ pump, determined from the analysis of the 
results shown in d and e. 


with Asn112 and Ser70 (Ile52 in BR). The estimated pK, value of 
Asp116, calculated using PROPKA”, was about 7.0, suggesting that 
Asp116 may be protonated in the crystal. To analyse the protonation 
state of Asp116 in more detail, we measured the absorption spectra of 
wild-type KR2 and D-to-N mutants of the titratable residues around the 
Schiff base, Asp116 and Asp251. We measured their spectra at pH 4.0 
(the crystallization condition) and 7.0 (the physiological condition), and 
found that the peak wavelength (Amax) of the wild-type spectrum at 
pH4.0 was red-shifted (536nm), as compared to that at pH7.0 
(526 nm). This result suggested that Asp116 and/or Asp251 are depro- 
tonated at pH 7.0, but protonated at pH 4.0. A similar red-shift was 
observed in the D251N mutant (526nm at pH7.0, and 540nm at 
pH 4.0), but the Anax of the D116N mutant was already significantly 
red-shifted at pH 7.0 (558 nm) (Fig. 4d). These results strongly suggested 
that Asp116 is protonated in our structure under acidic conditions, and 
is deprotonated in the resting state under neutral conditions. Since the 
previous spectroscopic analysis revealed that Asp116 functioned as a 
proton acceptor in the M intermediate’’, it was assumed that our struc- 
ture determined under acidic conditions had a similar conformation to 
that of the M intermediate, in which Asp116 was protonated. This notion 
is in good agreement with the previous BR study, in which the authors 
demonstrated that the acid-induced structural change of BR resembled 
the change observed upon the formation of the M intermediate”. 

To obtain structural information about KR2 under neutral conditions, 
we soaked the KR2 crystals at pH 7.5-8.5 for 48 h. The soaking led to the 
clear colour change of the crystal from purple to red (Fig. 4e), and thus 
we surmised that this soaking experiment changed the pH in the crystal 
and deprotonated Asp116. Using one of these red crystals, we deter- 
mined the crystal structure of KR2 under neutral conditions, at 2. 3A 
resolution. While the overall architectures of KR2 under both pH con- 
ditions were essentially identical (r.m.s.d. of 0.22 A over all Co atoms), 
there were some notable differences around the Schiff base (Fig. 4c). 
Under acidic conditions, the water molecule between Asn112 and 
Asp251 is located closer to Asp251 and Arg109, and these two residues 
interact by a water-mediated hydrogen bond. However, under the neut- 
ral conditions, the distance between the water and Asn112 was reduced 
from 5.1 to 3.1 A, and thus Arg109 forms a direct hydrogen bond with 
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Figure 4 | The Schiff base regions under acidic and neutral conditions. 
a-c, Stereo views of the Schiff base regions of BR (a) and KR2 under acidic 
(b) and neutral (c) conditions. Red spheres and black dashed lines represent 
water molecules and hydrogen bonds, respectively. Grey dashed lines and 
numbers indicate the distances between Asn112 and the water in A. d, The 


Asp251. Since Asp251 and Arg109 are reportedly highly conserved and 
required for the Na‘ -pump activity, these structural differences suggest 
that the rearrangement of the water upon the M intermediate formation 
modulates the interaction between Asp251 and Arg109*°"*, 

Consistent with the pH-dependent peak shift of the absorption spec- 
tra observed in the purified proteins and crystals (Fig. 4d, e), the key 
difference was found in Asp116. In contrast to the observations made 
under acidic conditions, the electron density for the side chain of 
Asp116 indicated the coexistence of two alternative rotamers, one 
pointing towards Ser70 and Asn112 (rotamer 1), and the other pointing 
towards the Schiff base nitrogen (rotamer 2). Rotamers 1 and 2 repres- 
ent the protonated and deprotonated forms of Asp116, respectively, and 
on the basis of the occupancy refinement, we estimated a 65:35 mixture 
(Extended Data Fig. 6). These results suggest that the deprotonated 
Asp116 is oriented towards the protonated Schiff base in the resting 
state (rotamer 2 in Fig. 4c), and Asp116 changes its conformation and 
hydrogen bonds with Asn112 and Ser70 after it accepts a proton from 
the Schiff base in the M intermediate (Fig. 4b and rotamer 1 in Fig. 4c). 
Since the proton transfer to Asp116 and its conformational change 
move the positively charged proton away from the Schiff base, these 
two events probably sequester the proton from the ion-conducting 
pathway and facilitate Na* transport. A previous study reported that 
the D116N mutant completely abolished Na* transport activity and 
implied that the proton transfer step was a prerequisite to Na” transport 
activity’®, thus we analysed whether the conformational change of the 
protonated Asp116 was functionally important. We measured the Na* 
transport activities of the S70T, S70A, N112A, and N112D mutants and 
found that while the S70T and N112D mutants still retained Na* 
transport activities, the S70A and N112A mutants lost Na* transport 
activity (Fig. 4f). These results indicate that the hydrogen bonds among 
Asp116, Ser70 and Asn112, which fix the conformation of the proto- 
nated Asp116, are essential for Na* transport. 
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absorption spectra of wild-type (WT) KR2 and mutants. e, Photographs of the 
same KR2 crystal before (left) and after (right) the soaking experiment. f, Pump 
activities of wild-type KR2 and mutants in E. coli cells, without (light blue) and 
with (dark blue) CCCP. Each tick mark of the y axis represents 0.2 pH change. 


Working model of Na* transport 

Based on these results, we propose a KR2 transport model (Fig. 5). In the 
resting state, a proton is bound to the Schiff base nitrogen, and this 
positive charge is located in the centre of the ion transport pathway 
(Fig. 5a), which prevents cation transport. After the retinal isomeriza- 
tion and the N-H flipping, the proton is transferred from the Schiff base 
to Asp116. This event would occur during the transition from the K to 
M intermediate (Fig. 5b). In the M intermediate, the protonated Asp116 
flips away from the Schiff base and forms hydrogen bonds with Ser70 
and Asn112. This flipping moves the positively charged proton away 
from the centre of the ion transport pathway, reduces the energy barrier 
to Na~ transport, and thereby facilitates Na* conduction (Fig. 5c). The 
proton would re-associate with the Schiff base in the O intermediate 
(the next intermediate after the M intermediate)"*, thus it would pre- 
vent the reverse flow of Na* (Fig. 5d, Supplementary Discussion and 
Extended Data Fig. 10). 


KR2 application to optogenetics 


Like HR and AR3, KR2 is also expected to work as an inhibitory 
optogenetics tool. To test this possibility, we introduced a human- 
codon-optimized KR2 gene, fused with a membrane trafficking signal, 
enhanced yellow fluorescent protein (eYFP), and an endoplasmic 
reticulum export signal’*, into cultured rat cortical neurons. The 
eYFP fluorescence was visible in neural cell bodies and membranous 
processes (Extended Data Fig. 7a). Voltage-clamp recordings showed 
that exposure to green light evoked a robust outward photocurrent, 
which was independent of the voltage (Fig. 6a, b). Next, we analysed 
whether the KR2-mediated photocurrent could inhibit neuronal spikes. 
Under current-clamp mode conditions, we injected a rectangular 
depolarizing current large enough to evoke repetitive spikes. As shown 
in Fig. 6c and Extended Data Fig. 7b, the generation of an action 
potential was completely blocked, while the membrane potential was 
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Figure 5 | Proposed model of Na* transport. The water, proton and Na“ are 
depicted by the red, green and orange spheres, respectively. a, In the resting 
state, the positive charge of the Schiff base proton blocks Na™ transport. 

b, After retinal isomerization and N-H flipping, the Schiff base proton is 
transferred to Asp116. ¢, In the M intermediate, the protonated Asp116 flips 
away and forms hydrogen bonds with Ser70 and Asn112. The flipping of 
Asp116 reduces the energy barrier to Na” transport, and thereby facilitates the 
Na* conduction. d, The re-protonation of the Schiff base would inhibit the 
reverse flow of Na*. 


hyperpolarized by KR2-activation with green light. The KR2-mediated 
inhibition of spiking was quite stable, and remained effective for more 
than one minute (Extended Data Fig. 7b). 

To examine whether KR2 could be applied to controlling neural 
activities in vivo, a KR2-Venus fusion protein was expressed in 
Caenorhabditis elegans neurons, using a pan-neuronal promoter. The 
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Venus fluorescence was visible in neural cell bodies and membranous 
processes throughout the nervous system (Extended Data Fig. 7a). 
To test the effects of KR2 activation on neural activity, the velocity 
of forward locomotion was measured. Illuminating transgenic 
worms with green light in the presence of ATR specifically slowed 
down the worms’ locomotion (Fig. 6d), suggesting that the photoacti- 
vation of KR2 decreased the neural activity under the experimental 
conditions. 

Based on these results, we propose that KR2 could indeed work as 
an inhibitory optogenetics tool in heterologous systems. To our know- 
ledge, this is the first example of the successful application of a eubac- 
terial rhodopsin to optogenetics. Unlike HR and AR3, KR2 is not 
expected to induce unintentional side effects, such as the activation 
of pH-sensitive channels”. The outward flow of Na* would be more 
physiological than that of H* or Cl. Moreover, the voltage independ- 
ent photocurrent of KR2 would allow efficient and reliable Na~ trans- 
port even under highly negative membrane potential conditions. 
Therefore, KR2 appears to have the potential to become an ideal 
inhibitory tool in some optogenetics experiments (Supplementary 
Discussion). 


Engineering of K*-pumping KR2 variants 

In the field of optogenetics, many variants of light-driven ion trans- 
porters have been engineered to expand the scope of their application. 
One of the most successful examples is the engineering of ChR var- 
iants with altered ion selectivity **. Thus, we next tried to engineer a 
KR2 variant with altered ion selectivity. 

In our gating model of KR2, the flipping of Asp116 and the proton 
sequestration do not actively select Na”, but only facilitate the transfer 
of a non-proton cation. Thus, we assumed that it would be possible to 
engineer KR2 variants that preferentially transfer another cation, 
especially K*. Since the extracellular pathway of the ion-conduction 
was still ambiguous, we focused on two residues, Asn61 and Gly263, 
forming the intracellular vestibule (Figs 3b, c and 6e). We expressed 
several mutants of these two residues in E. coli, and found that some 
mutants showed outward K* currents (Fig. 6e-g). Surprisingly, the 
double mutation N61P/G263W further increased the K” conduction, 
and this mutant (KR2x+) preferentially transferred K* over Na* 
(Fig. 6e-g). Although the K*/Na* selectivity ratio is still not very high 
(1.17 0.15), the intracellular K* concentration is ten times higher 
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Figure 6 | Optogenetic applications and molecular engineering of KR2. 
a-c, Electrophysiology in cortical neurons. a, Voltage-clamp recording of the 
KR2 photocurrent. b, The current-voltage relationship. Values are 

means + s.e.m. of 10 experiments. c, Current-clamp recording. Light 
illumination (green bar) inhibited the generation of the action potential. 
RMP, resting membrane potential. d, Effect of KR2 activation on C. elegans 
locomotion behaviour. The plot shows the change in the motion speed during 
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and after light illumination (mean ~ s.e.m., n = 9-10). e, Transverse section 
from the intracellular side. f, Pump activities of wild-type (WT) KR2 and 
mutants in E. coli cells suspended in 100 mM NaCl (top) or KCl (bottom), 
without (light blue) and with (dark blue) CCCP. Each tick mark of the y axis 
represents 0.5 pH change. g, Ion selectivities of wild-type KR2 and mutants. 
Values are means and standard deviation (s.d.) of three experiments. ND, not 
detected. 
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than that of Na*, so KR2x+ is expected to work more efficiently as a Kt 
pump in vivo. Since no isolated or engineered light-driven K" pump 
has been reported so far, KR2,x+ provides a useful platform for the 
further engineering of KR2 variants to expand their applications. 


Conclusion 


The present crystal structures of KR2 and the accompanying func- 
tional analyses revealed the structural basis by which this microbial 
rhodopsin acquired Na*-pump activity, and enabled us to engineer 
the first light-driven K* pump. Moreover, the electrophysiological 
and behavioural assays in mammalian neurons and nematodes 
showed the potential of KR2 as a new inhibitory optogenetics tool. 

To fully understand the mechanism of Na* transport by KR2, and 
to apply KR2 in several fields of neuroscience, additional studies are 
clearly needed. However, this study provides a valuable framework for 
the further development of optogenetics technology, as well as the 
basic mechanistic understanding of this new class of rhodopsin family 
proteins. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Cloning, protein expression and purification. The kr2 gene (GI: 478430804) 
was cloned from K. eikastus (NBRC 100814") and subcloned into a pET- 
modified vector, with the resulting construct encoding a TEV cleavage site 
followed by a Hisg tag at the C terminus’’. For biochemical and spectroscopic 
experiments (pump activity assay, atrFTIR measurement, flash photolysis, 
quantification of KR2 expression in E. coli, thermostability assay), we mainly 
used a synthesized KR2 gene with codons optimized for E. coli expression”®. 
The protein was expressed in C41 (DE3) E. coli cells, induced with 0.5mM 
isopropyl B-p-thiogalactopyranoside (IPTG) and supplemented with 10 1M 
ATR (Sigma), for 3h at 37°C. The cells were disrupted and the membrane 
fraction was collected by ultracentrifugation (125,000g, 1h). The protein was 
solubilized with 1.5% n-dodecyl-f-p-maltoside (DDM), in buffer containing 
50 mM Tris-HCl (pH 8.0), 300 mM NaCl, 20 mM imidazole, 0.1 mM phenyl- 
methylsulfonyl fluoride (PMSF), and 0.3% cholesteryl hemisuccinate (CHS). 
After Ni-NTA affinity chromatography, the collected fractions were treated 
with TEV protease and dialysed against buffer A (50mM Tris-HCl, pH 8.0, 
150mM NaCl, 0.02% DDM, 0.004% CHS). The cleaved Hisg tag and TEV 
protease were removed with Ni-NTA resin, and the protein was purified 
further by gel-filtration chromatography in the presence of 50mM Tris-HCl 
(pH 8.0), 0.05% DDM, and 0.01% CHS. The purified protein was concentrated 
to 20mg ml’ using a centrifugal filter device (Millipore 100 kDa M, cutoff), 
and frozen until crystallization. 

Crystallization. The purified KR2 protein was mixed with monoolein (1-oleoyl- 
rac-glycerol) (Nu-Chek) in a 2:3 protein to lipid ratio (w/w). Aliquots (50-100 nl) 
of the protein-LCP mixture were spotted on a 96-well sandwich plate and over- 
laid by 800-1,000 nl of precipitant solution by the crystallization robot, mosquito 
LCP (TTP LabTech). Crystals of KR2 were initially obtained in the MemMeso 
crystallization screen (Molecular Dimensions), and optimized with 42% PEG 
200, 0.1 M sodium acetate, pH 4.0, and 0.05 M MgCly. The crystals were harvested 
directly from the LCP matrix using MiTeGen micromounts, and were flash- 
cooled in liquid nitrogen. The KR2 crystals under neutral conditions were pre- 
pared by adding soaking buffer (42% PEG 200, 0.1 M Tris-HCl, pH 8.0-9.0, and 
0.05 M MgCl,) into the well of the plate containing KR2 crystals at pH 4.0. The 
final pH was 7.5-8.5. 

Data collection and structure determination. X-ray diffraction data sets were 
collected on the BL32XU beamline at SPring-8, using a 10-j1m-wide, 10-1m-high 
microbeam”. The diffraction data were processed with the XDS programs”, and 
the structure was solved by molecular replacement, using the structure of 
xanthorhodopsin (PDB ID: 3DDL) as a search model with Molrep*! in the 
CCP4 suite**. The resultant structure was iteratively refined in Refmac5** and 
Phenix™, and manually rebuilt in Coot**. In the Ramachandran plot, 97.0% of 
residues were in favoured regions and 2.6% of residues were in allowed regions. 
Final refinement statistics are summarized in Extended Data Fig. 2a. All figures 
were prepared with Cuemol (http://www.cuemol.org). 

Measurement of Na‘ -pump activity in E. coli. E. coli cells expressing KR2 were 
cultured as described previously'’®. The cells were collected by centrifugation 
(4,800g, 3 min), washed three times with 100mM NaCl, and re-suspended in 
the same solution for pump activity measurements. For the K* transport assay, 
the cells were washed and suspended in 100 mM KC] in a similar manner. The 
cell suspension (7.5 ml) was placed in darkness and then illuminated using the 
output of a 1-kW tungsten-halogen projector lamp (Master HILUX-HR, 
Rikagaku, Japan) through a glass filter (Y-52, AGC Techno Glass, Japan) for 
2.5min at wavelengths >500nm. The light-induced pH changes during the 
incubation were monitored with a pH meter (F-55, Horiba, Japan). 
Measurements were repeated under the same conditions after the addition of 
10 uM CCCP. 

atrFTIR measurement. Na“ binding to wild-type KR2 and mutants was studied 
by atrFTIR. Before the measurement, KR2 was reconstituted into 1,2-dioleoyl-sn- 
glycero-3-phosphocholine (DOPC; molar ratio of KR2: DOPC = 1:20)". The 
protein was placed on the surface of a diamond atrFTIR crystal (nine internal 
total reflections), and was dried in a gentle stream of N>. The sample was then 
rehydrated with the first solvent, containing 50mM NaCl in 5mM Tris-HCl 
(pH 8.0), at a flow rate of 0.5 ml min” '. The atrFTIR spectrum was recorded at 
2cm! resolution with an FTIR spectrometer (Agilent, California, USA) 
equipped with a liquid-nitrogen-cooled MCT detector. After exchange to the 
second solvent (50mM KCI] in 5mM Tris-HCl, pH 8.0) and recording the 
FTIR spectrum, the difference FTIR spectrum was calculated by subtraction of 
the data obtained for the first and second solvents. 

Flash photolysis. The transient absorption spectra of KR2 mutants were obtained 
by a flash photolysis system, using a multichannel detector (Hamamatsu 
Photonics, Japan). The purified sample was reconstituted into DOPC (molar 
ratio of KR2: DOPC = 1:20) and was suspended in 50mM Tris-HCl buffer 


(pH 8.0), containing 100 mM NaCl"*. The sample solution was placed in a quartz 
cuvette and excited with a beam of the second harmonics of a nanosecond-pulsed 
Nd?*-YAG laser (4 = 532nm, INDI40, Spectra-Physics). The laser power was 
3 mJ per pulse, and the repetition rate was 0.5 Hz, which is sufficiently slower than 
the photocycle rate of KR2, to avoid the excitation of transient photo-intermedi- 
ates. The transient absorption spectra of KR2 were obtained by calculating the 
ratio of the transmitted probe light from a Xe arc lamp (L8004, Hamamatsu 
Photonics, Japan), with and without laser excitation. The time-evolution of the 
absorption change at a specific probe wavelength was obtained by combining the 
data points at different time-delays (taclay = 80 Hs-200 ms). 

Quantification of KR2 expressed in E. coli. E. coli cells expressing KR2 were 
centrifuged and suspended in 50mM Tris-HCl buffer (pH 8.0), containing 
100 mM NaCl, to a final volume of 3 ml. Then, 200 pl of 1 mM lysozyme was 
added. The E. coli cells were disrupted by ultrasonication, and KR2 was solubi- 
lized in 3.0% DDM. The absorption change, representing the bleaching of KR2 by 
hydroxylamine, was measured with a UV-vis spectrometer (UV-2400PC, 
Shimadzu) with an integrating sphere (ISV-722, Jasco, Japan) for 0.5-1.5h, after 
the addition of hydroxylamine to a final concentration of 500 mM”. Illumination 
was provided by the output from a 1-kW tungsten-halogen projector lamp 
(Master HILUX-HR, Rikagaku, Japan) through a glass filter (Y-52, AGC 
Techno Glass, Japan) at wavelengths >500 nm. The molecular extinction coef- 
ficient of rhodopsin (¢) was calculated by the ratio between the absorbances of 
rhodopsin and retinal oxime (¢ = 33,900 M'cm7)), and the amount of KR2 
expressed in E. coli cells was determined by the absorbance of the bleached 
rhodopsin and the ¢ value. 

Measurement of thermostability. The thermostabilities of the KR2 mutants 
were estimated from the rates of colour bleaching of the pigment at 70°C. A 
buffer containing 100 mM NaCl (or RbCl for wild-type KR2), 50 mM Tris-HCl 
(pH 8.0), and 0.1% DDM was used for the measurement. The concentration of the 
KR2 mutants was adjusted to ODs3,=0.5, and the solution (0.8ml) was 
immersed in a heat bath maintained at 70°C for 0.5-128 min. After heating, 
the solution was rapidly cooled on ice, and the protein aggregates were removed 
by centrifugation (10,000g, 3 min). The absorption spectrum of the supernatant 
was measured with a UV-vis spectrometer (UV-2400PC, Shimadzu), and the 
fraction of bleached protein was estimated from the decrease of the absorption 
at Arnaz) 

Measurement of absorption spectra. Absorption spectra of KR2 mutants recon- 
stituted into DOPC were recorded with a UV-vis spectrometer equipped with an 
integrating sphere. The samples were suspended in buffers containing 150 mM 
NaCl and 10 mM citric acid monohydrate, MES, MOPS, HEPES, CHES or CAPS. 
The pH was adjusted to pH 4 or pH7 by the addition of NaOH or HCl. 
Molecular cloning of mammalian expression construct. The gene encoding 
human-codon-optimized KR2 (hKR2) was commercially synthesized (GenScript, 
Japan) and subcloned in-frame into the EcoRI-BamHI sites of ptdTomato-N1. To 
prepare the 3.0 version of hKR2, the tdTomato was replaced with the eYFP carrying 
the membrane trafficking and ER export signals™* derived from the pAAV-EF1a- 
DIO-eNpHR3.0-eYFP vector (a gift from K. Deisseroth, Stanford University). The 
expression construct driven by the CMV promoter was named phKR2-3.0-eYFP. 
For optimization of hKR2 expression in neurons, the CMV enhancer/promoter of 
phKR2-3.0-eYFP was removed by digestion with Asel and Nhel, and a PCR-amp- 
lified CaMKIIx promoter was inserted. A PCR-amplified woodchuck hepatitis 
virus posttranscriptional regulatory element (WPRE) was additionally inserted in 
the NotI site of the plasmid, by In-Fusion cloning (Takara Bio). The pLenti- 
CaMKII-hChR2-eYFP-WPRE plasmid (a gift from K. Deisseroth) was used as a 
PCR template for the preparation of the CaMKIIa promoter and WPRE. The 
expression construct driven by the CaMKIIa promoter was named pCaMKIIa- 
hKR2-3.0-eYFP-WPRE. All constructs were verified by sequencing. 

Mammalian cell culture. Cortical neurons were isolated from embryonic day 16 
Wistar rats (Japan SLC Inc., Shizuoka, Japan), using Nerve-Cells Dispersion 
Solutions (Sumitomo Bakelite, Tokyo, Japan) according to the manufacturer’s 
instructions, and were grown in culture medium (Sumitomo Bakelite, Tokyo, 
Japan) under a 5% CO, atmosphere at 37°C. The expression plasmids were 
transiently transfected into cortical neurons by calcium phosphate transfection 
at days in vitro (DIV) 5 or 6. Electrophysiological recordings were then conducted 
at DIV 21-22 with neurons identified as expressing eYFP fluorescence by a con- 
ventional epifluorescence system. 

Electrophysiology in rat cortical neurons. All experiments were performed at 
room temperature (23+ 2°C). Photocurrents were recorded as previously 
described** using an EPC-8 amplifier (HEKA Electronic, Lambrecht, Germany) 
under a whole-cell patch-clamp configuration. The data were filtered at 1 kHz, 
sampled at 10kHz (Digdatal440 A/D, Molecular Devices Co., Sunnyvale, 
California, USA), and stored in a computer (pClamp10.3, Molecular Devices). 
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The internal pipette solution for whole-cell voltage or current-clamp record- 
ings from cortical neurons contained 125 mM K-gluconate, 10 mM NaCl, 0.2 mM 
EGTA, 10 mM HEPES, 1mM MgCh, 3mM MgATP, 0.3mM Na,GTP, 10 mM 
Na-phosphocreatine, and 0.1 mM leupeptin, adjusted to pH 7.4 with KOH. The 
extracellular artificial cerebrospinal fluid solution contained 125mM NaCl, 
2.5mM KCl, 25 mM NaHCOs, 1.25 mM NaH,PO,, 2mM CaCl, 1mM MgCl, 
and 11 mM glucose, and was bubbled with a gas mixture containing 95% O, and 
5% COs. In all cortical neuron experiments, artificial cerebrospinal fluid con- 
tained 20M 6,7-dinitroquinoxaline-2,3-dione (DNQX, Tocris Bioscience, 
Ellisville, Missouri, USA), 25M p-(-)-2-amino-5-phosphonovaleric acid (D- 
AP5, Tocris), and 100 1M picrotoxin (Nacalai Tesque, Kyoto, Japan) to block 
all synaptic inputs. The directly measured liquid junction potential was 11.4mV 
and was compensated for. 

Optics used in electrophysiological experiments. Irradiation was performed 
using a SpectraX light engine (peak, 530-600 nm, Lumencor Inc., Beaverton, 
Oregon, USA) controlled by computer software (pCLAMP10.3, Molecular 
Devices). The power of the light was directly measured under microscopy by a 
visible light-sensing thermopile (MIR-100Q, Mitsubishi Oil Chemicals, Tokyo, 
Japan) and was 5.4mW per mm’. 

Confocal images of cortical neurons. Cortical neurons prepared for electro- 
physiological recordings (DIV 22 or 24) were imaged three-dimensionally with 
a z-axis interval of 0.48 um, using a conventional confocal laser-scanning micro- 
scope (LSM510META, Carl Zeiss) equipped with a 40 objective. 

Generation of C. elegans transgenic lines. K. eikastus KR2 tagged with the 
fluorescent protein Venus (KR2-Venus) was placed under the pan-neuronal 
H20 promoter” by the Gateway recombination system, in which the promoter 
on an entry vector and the open reading frame on a destination vector were 
combined by an LR reaction”. All C. elegans transgenic lines for optical stimu- 
lation experiments carried the lite-1(xu7) mutation, to reduce the intrinsic photo- 
taxis response®®. The expression construct pH20p::KR2-venus (20 ng pl ') was 
introduced into the animals using standard transgenic techniques” with the 
transformation marker punc-122p::venus (10 ng ul”), generating lite-1(xu7); 
peEx1615[H20p::KR2::venus unc-122p::venus] (JN1615). Animals that harboured 
only the transgenic marker, lite-1(xu7); peEx1627[unc-122p::venus] (JN1627), 
were used as a control. 

Behavioural assay in C. elegans. The forward locomotion speed of the animals 
was measured using a worm-tracking system*’. C. elegans animals were grown to 
young adults on standard media” and further cultivated overnight in the pres- 
ence or absence of 5 tM ATR. For each trial, roughly 30 transgenic animals were 
washed from the culture plate and transferred to an ATR-free test plate (25 mM 
potassium phosphate (pH 6), 1 mM CaCh, 1 mM MgSOu,, 50 mM NaCl, 2% agar). 
After 3 min acclimatization on the test plate, a 30s pulse of green light (peak 
wavelength = 525 nm; 0.15 mW per mm”) was delivered by a ring-shaped light- 
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emitting diode (CCS Inc.). The average locomotion speed of all animals on a test 
plate was calculated for each second. 

Statistical methods. No statistical methods were used to predetermine sample 
size. 
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Extended Data Figure 1 | Ion transport mechanisms of light-driven proton numbering corresponds to the residue numbers in HR from Halobacterium 
pumps and chloride pumps. a, The proton transport mechanism for light- salinarum. c, The electrostatic repulsion problem has to be solved by the light- 
driven proton pumping in BR*“. The Schiff base proton works as the substrate, driven cation pumps, rather than the proton pumps. The substrate cation (A*) 
and avoids the electrostatic repulsion between itself and the substrate. b, The — must avoid the electrostatic repulsion with the Schiff base proton. 

chloride transport mechanism for light-driven chloride pumping in HR**. The 
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a 
Acidic Neutral 
Data collection 
Space group 1222 1222 


Cell dimensions 


a, b,c (A) 40.6, 81.6, 233.8 40.6, 83.5, 234.6 
a, By (°) 90, 90, 90 90, 90, 90 

Resolution (A) 50.0-2.3 (2.48-2.32) 50.0-2.3 (2.49-2.33) 
Reve 0.114 (0.514) 0.159 (0.758) 
Hol 10.5 (2.7) 11.4 (2.8) 
Completeness (%) 98.0 (99.9) 98.1 (100.0) 
Redundancy 4.8 (4.9) 7.2 (7.4) 
CC, 0.997 (0.906) 0.997 (0.834) 
Refinement 
Resolution (A) 40.6-2.3 (2.37-2.30) 40.9-2.3 (2.44-2.30) 
No. reflections 17,533 (1,588) 17,988 (2,974) 
Ron: Riree 0.2072/0.2486 0.1949/0.2308 
No. atoms 

Protein 2,142 2,164 

Ligand (ATR) 20 20 

Lipid 30 32 

Water af 67 

Others (PEG) 7 0 
B-factors 

Protein 40.5 36.1 

Ligand (ATR) 33.2 31.1 

Lipid 49.2 45.5 

Water 43.7 40.3 

Others (PEG) 54.8 a 
R.m.s, deviations 

Bond lengths (A) 0.008 0.008 

Bond angles (°) 1.005 1.023 
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*Highest resolution shell is shown in parentheses. 


Extended Data Figure 2 | Crystallography. a, Table describing data collection 
and refinement statistics of KR2 in acidic and neutral conditions. b, The crystal 
packing of KR2, viewed parallel to the membrane. c, Electron-density map for 


the retinal binding pocket of KR2. A stereo view of the 2F, — F. maps (blue 
mesh, contoured at 1.5¢) for the retinal binding pockets of KR2 under the acidic 
conditions is shown. Water molecules are depicted by red spheres. 
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Extended Data Figure 3 | Structure-based sequence alignment of microbial 
rhodopsins. The sequences are KR2, putative Na*-pump rhodopsin (NaR) 
from Dokdonia sp. PRO95 (DsNaR, GenBank ID: AEX55013.1), putative NaR 
from Nonlabens dokdonensis DSW6 (NdNaR, GenBank ID: AGC76155.1), 
putative NaR from Truepera radiovictrix (TrNaR, NCBI reference sequence: 
YP_003706581.1), putative NaR from Gillisia limnaea (GINaR, NCBI reference 
sequence: WP_006989277.1), putative NaR from Indibacter alkaliphilus 
(IaNaR, NCBI reference sequence: WP_009036080.1), xanthorhodopsin from 
Salinibacter ruber (Xanthorhodopsin, PDB ID: 3DDL)"*, blue proteorhodopsin 
from Med12 (Blue_Proteorhodopsin, PDB ID: 4JQ6)*’, bacteriorhodopsin 
from Halobacterium salinarium (Bacteriorhodopsin, PDB ID: 1IW6)™, 
halorhodopsin from Halobacterium salinarium (Halorhodopsin, PDB ID: 
1E12)*, sensory rhodopsin II from Natronomonas pharaonis 
(Sensory_rhodopsinII, PDB ID: 3QAP)**, and the chimaeric channelrhodopsin 
between ChR1 from Chlamydomonas reinhardtii and ChR2 from 


Chlamydomonas reinhardtii (C1C2, PDB ID: 3UG9)””. The sequence 
alignment between KR2, xanthorhodopsin, blue proteorhodopsin, 
bacteriorhodopsin, halorhodopsin, sensory rhodopsin II and C1C2 was created 
using the T-Coffee server**. Secondary structure elements for KR2 are shown as 
coils and arrows. TT represents turns. Identical and conservatively substituted 
residues are highlighted in red. The N-helix residues and the residues tethering 
the N-helix to the protein core (Glu160 and Arg243 in KR2) are coloured green. 
Asp112 and Asp116 in the NDQ motif, and the Ser residue interacting with 
Asp116 (Ser70 in KR2) are coloured blue. Carboxylates on ECL1 are coloured 
yellow. The glutamine (Gln123 in KR2) that contributes to the putative 
intracellular Na* binding site is coloured purple. The glycine and tyrosine 
residues that form the hydrophobic hole near the B-ionone ring of ATR are 
coloured orange. The asterisk under the alignment indicates the lysine that 
forms the Schiff base with ATR. 
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Extended Data Figure 4 | Structural comparisons between KR2, BR and XR. 
a, b, Side views (left) and extracellular views (right) of the superimposed 

structures of KR2 and BR (a) or KR2 and XR (b). The ATRs are shown as stick 
models. ¢, d, Structural comparison of the Schiff base orientations between KR2 
and BR. c, Stereo view of the KR2 structure and its electron density in the Schiff 
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base region. The 2F, — F, map (blue mesh, contoured at 2.0c) is shown. 

d, Stereo view of the BR structure in the same region. The water molecule is 
depicted by a red sphere, and the hydrogen bond between the Schiff base and 
the water molecule is represented by the black dashed line. 
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Extended Data Figure 5 | Functional characterization of wild-type KR2 and 
its mutants. a, Na* binding to wild-type (WT) KR2 and three ECL1 mutants. 
atrFTIR difference spectra of wild-type KR2 and three ECL1 mutants (E90Q/ 
E91Q, D98N, and D102N) upon the exchange of 50 mM NaCl/KCl. Dotted 
lines are the duplicated difference spectra of wild type. b-d, Photocycle of the 
KR2 D102N mutant. b, c, Transient absorption spectra of the KR2 D102N 
mutant (b) and time traces of the absorption changes (c) at specific probe 
wavelengths. d, Photocycle scheme for the Na* pump, determined from the 
analysis of the results shown in b and c. The lifetime of O-decay of wild-type 
KR? is only shown for the major component (90.6% of total O-accumulation), 
and the previously reported minor component (9.4%) with t = 112 ms is not 
discussed here’*. Values are means and s.d. calculated from the traces shown in 
c. e, Expression levels of wild-type KR2 and its mutants. Blue bars indicate the 
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Sample names Time constant of the decay / min 


KR2 WT (Na’) 15044 
KR2 WT (Rb’) 27.4+0.5 
E11A 1740.1 
D102N 3943 
E160A 0.55 + 0.06 
R243A 0.31 + 0.08 


expression levels of wild-type KR2 and its nine mutants (mean = s.d., n = 3). 
The expression of the Al-18 mutant was too low to be determined. f, Left, 
magnified view of the interactions between the N-helix and TM4, TM7 and 
ECL1. Right, stereo view illustrating the interactions between the N-helix and 
TM4, TM7 and ECL1. Hydrogen bonds are shown by dashed lines and water 
molecules are represented by red spheres, the residue on the B-sheet is 
represented as a green stick model. g, Time constants of the decay of detergent- 
solubilized wild-type KR2 and mutant proteins in thermostability assays, 
shown as means + s.d. estimated by the least squares fitting technique with an 
exponential function. All mutant proteins were assayed in buffer containing 
100 mM NaCl (or RbCl for wild-type KR2), 50 mM Tris-HCl (pH 8.0), and 
0.1% DDM. 
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Extended Data Figure 6 | The electron densities around Asp116 under negative peak, suggesting its alternative conformation. b, The refinement with 
neutral conditions. a—c, Stereo views of the structures and densities in the Asp116 omitted also suggested the presence of a mixture of Asp116 with 
Schiff base region. The 2F, — F. maps (blue mesh, contoured at 1.0c) and different conformations. c, Asp116 with both rotamers 1 and 2 fits well in the 


F, — F, maps (green and red meshes, contoured at 3.0 and —3.0o, respectively) _ electron density. 
are shown. a, Structural refinement with Asp116 rotamer 1 resulted in a strong 
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Extended Data Figure 7 | Electrophysiological and behavioural analyses. 
a, Membrane expression and localization of KR2 in mammalian cortical 
neurons (left, scale bar 20 pm) and C. elegans (right, scale bar 100 jim). b, KR2- 
dependent stable hyperpolarization. A current pulse was injected into a KR2- 
expressing neuron to evoke repetitive action potentials. The membrane 
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potential was hyperpolarized for 10s (top), 30s (middle) and 60s (bottom) 
when the neuron was illuminated by green light (5.4 mW per mm”, green bar). 
The resting potential was —74 mV throughout the experiment. c, The recovery 
of the locomotion speed in KR2-expressing C. elegans after cessation of 
illumination with green light. The green bar represents the light illumination. 
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Extended Data Figure 8 | The retinal binding pockets and their surfaces in _c, Surface representations of the pocket, from the same viewpoint as in a and 
BR, XR and KR2. a, Magnified views of the retinal binding pockets in BR (left _b, showing the (putative) interaction between the second chromophore and the 
panel), XR (middle panel) and KR2 (right panel). The residue that occludes the _ protein in the crystal structure of XR with salinixanthin (PDB ID: 3DDL) (left 
hydrophobic pore is indicated by the red dashed circle. b, Surface panel), and the putative binding site of the second chromophore, carotenoid 
representations of the pocket, from the same viewpoint asina.Theblueandred (right panel). 

dashed circles indicate the hydrophobic pores of XR and KR2, respectively. 
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Extended Data Figure 9 | Putative water molecule between Asn112, Trp113__ black dashed lines and the numbers are the possible hydrogen bonds and their 
and Asp251. Stereo view of the structures and densities in the Schiff base distances (in A) between the putative water molecule and the neighbouring 
region, including the putative water molecule between Asn112, Trp113 and residues. 

Asp251. The 2F, — F. maps (blue mesh, contoured at 1.50) are shown. The 
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Extended Data Figure 10 | Calculated absorption spectra of wild-type KR2 _ reported spectra'*. The green dotted line represents the dark absorption 
in the O intermediate state. The O absorption spectra of wild-type KR2 inthe spectrum of wild-type KR2 in the solution containing NaCl. The number above 
solution containing NaCl (red) or LiCl (yellow), calculated from the previously _ each spectrum is the peak wavelength (Amax)- 
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An extremely young massive clump forming by 
gravitational collapse in a primordial galaxy 


A. Zanella!, E. Daddi, E. Le Floc’h!, F, Bournaud!, R. Gobat!?, F. Valentino!, V. Strazzullo’?, A. Cibinel!*, M. Onodera’, V. Perret®, 


F. Renaud’” & C. Vignali*? 


When cosmic star formation history reaches a peak (at about redshift 
z ~ 2), galaxies vigorously fed by cosmic reservoirs'” are dominated 
by gas** and contain massive star-forming clumps”®, which are 
thought to form by violent gravitational instabilities in highly tur- 
bulent gas-rich disks”*. However, a clump formation event has not 
yet been observed, and it is debated whether clumps can survive 
energetic feedback from young stars, and afterwards migrate 
inwards to form galaxy bulges’. Here we report the spatially 
resolved spectroscopy of a bright off-nuclear emission line region 
in a galaxy at z = 1.987. Although this region dominates star forma- 
tion in the galaxy disk, its stellar continuum remains undetected in 
deep imaging, revealing an extremely young (less than ten million 
years old) massive clump, forming through the gravitational collapse 
of more than one billion solar masses of gas. Gas consumption in this 
young clump is more than tenfold faster than in the host galaxy, 
displaying high star-formation efficiency during this phase, in agree- 
ment with our hydrodynamic simulations. The frequency of older 
clumps with similar masses’, coupled with our initial estimate of 
their formation rate (about 2.5 per billion years), supports long life- 
times (about 500 million years), favouring models in which clumps 
survive feedback and grow the bulges of present-day galaxies. 

The high spatial resolution and sensitivity of Hubble Space 
Telescope (HST) imaging and spectroscopy routinely allows us to 
resolve giant star-forming regions (clumps) inside galaxies at z ~ 2, 
at 3 Gyr after the Big Bang. Stellar population modelling has revealed a 
wide range of ages for clumps observed in the continuum®'*"*, with an 
average age of ~100 Myr. However, clump formation rates and life- 
times remain poorly constrained’’’*’”'*, Continuum-based stellar 
ages are probably underestimated because clumps lose stars and reac- 
crete gas during their evolution’, whereas very young ages (<30 Myr) 
cannot be probed with continuum imaging alone. High equivalent 
width (EW) emission lines are required. 

We obtained 16 orbits of HST Wide Field Camera 3 (WFC3) G141 
slitless spectroscopy and imaging with the F140W, F105W and F606W 
filters targeting a galaxy cluster at z = 2 (ref. 19). The F606W band 
traces the star formation distribution in the ultraviolet rest frame, 
and the F140W probes the optical rest frame, reflecting the stellar 
mass distribution. Nebular [O 11]45007 A emission was detected 
for 68 galaxies with stellar masses 9.5 < log(M/Mo) < 11.5 and 
redshifts 1.3 S Z,pec = 2.3, with measurements or upper limits for 
Hf, [O 143727 A and Ho when available. From spatially resolved 
emission line maps we discovered a galaxy at z = 1.987 with a remark- 
ably bright, off-nuclear emission line region (Flo mm = (4.3 + 0.2) X 
10 1” erg cm *s |, observed, plus HB and [O 1]; Methods), lacking 
any obvious counterpart in broadband imaging (Fig. 1). The [O 1] 
emission is spatially unresolved (radius <500 pc) and located at the 
apparent distance of 1.6 + 0.3 kpc (offset significance 7.60; Methods) 
from the nucleus (that is, the barycentre of the stellar continuum). The 


deprojected distance is constrained within 3.6 = d = 6.2 kpc, corres- 
ponding to 1.3-2.2 times the galaxy half-light radius (Methods). 
Subtracting a point-like emission leaves no significant residuals in 
the [O m1] map. The continuum reddening and mass-to-light ratio 
(M/L) maps are flat over the galaxy, excluding the possibility that 
the feature is artificially induced by dust lanes or inhomogeneous 
attenuation (Extended Data Figs 1 and 2 and Methods). 

From emission line ratios we estimated a reddening E(B— V) ~ 0.3 
and a gas-phase metallicity Z ~ 0.4Z. for this region, consistent with 
the host galaxy. Robust upper limits on its stellar continuum were 
estimated with detailed simulations, leading to remarkably high lower 
limits on the emission-line EWs. Given these limits and the line lumin- 
osities, the emitting region cannot be powered by a massive black hole 
or by shock ionization from wind outflows. We similarly disfavour the 
hypothesis of a transient, because the line luminosities remain con- 
stant over time, or an ex situ merging system, because its older under- 
lying stellar continuum would be detected”. In addition, this galaxy is 
classified as a disk (not a merger) from its Asymmetry and Myo para- 
meters (Methods). A young star-forming clump formed in situ is 
therefore the most plausible interpretation. On the basis of stellar 
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Figure 1 | A massive, very young clump in a disk galaxy at z = 1.987. 

a-c, Emission line maps showing off-nuclear, unresolved, bright [O m1] (a), HB 
emissions (b) and [O 11] emissions (c), with significances of 180, 30 and 3.50. 
d-f, No counterpart is detected in the F140W (d), F105W (e) and F606W 

(f) direct images. The flux contours of the [O m1] map ina have been overplotted 
on the direct images. The colour scales logarithmically with flux from the 
minimum (black) to maximum (white) level displayed (they are different for 
a-c and d-f). The black cross in each panel indicates the barycentre of the 
stellar optical rest-frame continuum; the white circle indicates the point-spread 
function full-width at half-maximum. 
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Figure 2 | Constraints on the clump’s age from reddening-corrected, rest- 
frame, emission-line EWs. Lower limits on the clump EW (black solid line) of 
[Om] (a), HB (b) and [O u] (c) and the ratio between the HB luminosity and the 
continuum at 1,500 A (d) compared with theoretical tracks. A Salpeter initial 
mass function is assumed, and different star formation histories are compared: 
red lines, single burst; blue lines, constant SFR; yellow lines, star formation 
history predicted by simulations? (Fig. 4). The effect of reddening (AE(B— V) = 
+0.1) is indicated in each panel (red arrow). The age of the clump is 
constrained to be <10 Myr. 


population synthesis modelling for galaxies with active star formation, 
the observed EWs require very young ages for the star formation event, 
with a firm upper limit of 10 Myr (Fig. 2). Thus, although the ubiquity 
of clumps in high-z galaxies has been known for a decade, we are 
witnessing here the formation of a star-forming clump in the early 
stage of its gravitational collapse. From the reddening-corrected 
line luminosities we estimate a clump star formation rate (SFR) of 
(32 + 6)Mo yr ', comparable to that of the rest of the host galaxy 
disk (Methods). The F140W continuum non-detection translates into 
a stellar mass limit My < 3 X 10°Mo. To infer the underlying gas 
mass of the clump we considered the Jeans mass of the galaxy as a 
plausible upper limit, because fragmentation at higher masses is 
unlikely. This constrains the clump gas mass to Mgas S 2.5 X 
10°Mo, assuming a maximal gas velocity dispersion cy ~ 80 kms ' 
(refs 21, 22; Methods). 

This finding offers new insights into the physics of clump formation 
in gas-rich turbulent media at high z. Using the estimate of its under- 
lying gas mass, stellar mass and SFR we can constrain the nature of its 
star formation mode. Its specific SFR (defined as SFR/M,) is more 
than 30-fold that of its host galaxy, a typical Main Sequence galaxy 
at z ~ 2. Similarly, the lower limit on the clump star formation effi- 
ciency (SFE = SFR/M,,.) is more than tenfold that of normal galaxies 
(Fig. 3), a behaviour that at galaxy-wide scales is observed only for 
extreme starbursts”’. At sub-galactic scales such a high SFE is observed 
for nearby molecular clouds”, which are small and transient features 
1,000-fold less massive than the present clump. Possibly at odds with 
what has been assumed so far'*’”"8, this provides observational evid- 
ence that giant clumps do not follow the Schmidt-Kennicutt law of 
normal star-forming galaxies, at least in the early stages of collapse. 
Instead, this luminous sub-galactic structure seems to follow the uni- 
versal star-formation law normalized by the dynamical time’. 
Comparing with the SFRs reported for older clumps with similar 
masses’, we estimated a SFR enhancement of ~3-5-fold at ‘peak 
formation’ with respect to later phases. This massive star-forming 
clump has a robust age estimate that is similar to or shorter than its 
dynamical time (Methods). 
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Figure 3 | The Schmidt-Kennicutt plane. Comparison of the trends for 
starbursts and Main Sequence galaxies” (black solid lines and shaded region 
indicating the 0.2 dex dispersion of the Main Sequence) with the location of 
observed (Vycl, red filled circle with error bars indicating s.d.) and simulated 
clumps (coloured dots connected with lines; solid and dashed lines for ages =30 
Myr and >30 Myr, respectively). The average locations of simulated clumps 
(red, thick line) and host simulated galaxy (square with error bars indicating 
s.d.) are shown. Sudden variations in the M,,s of clumps are probably due to the 
accretion of gas-rich clouds or small clumps. 


Prompted by our observations, we investigated the properties of 
clumps in their formation phase by using high-resolution simulations’. 
We solved the dark matter, stellar and gas gravity and hydrodynamics 
at a resolution of 3.5 pe and gas cooling down to 100 K, and we 
modelled the feedback processes from young stars onto the gas: 
photo-ionization, radiation pressure and supernova explosions. 
Figure 4 shows a typical, My ~ 3 X 10!°Mo, z = 2 galaxy model with 
giant clumps formed through violent disk instability. Their formation 
sites are located 2.1-7.0 kpc from the nucleus of a galaxy that has an 
half-mass radius of 4.5 kpc, consistent with many other simula- 
tions'®”°?* and our observations. All clumps, and especially the young- 
est ones, are brighter in the SFR map than in the continuum: they 
undergo a burst of star formation during their initial collapse, with 
peak SFRs about (10-20)Mo yr, consistent with our observations, 
then evolve to a lower specific SFR regime within 20 Myr, once feedback 
has regulated star formation and their stellar mass has grown. Our 
simulations further corroborate the idea that all clumps behave like 
galactic miniatures of starbursts in the Schmidt-Kennicutt diagram 
during their first 20 Myr (Fig. 3). The SFE of simulated clumps 
decreases at later times, although it remains 2 0.5 dex higher than that 
of Main Sequence galaxies, consistent with their shorter dynamical 
times. The presence of massive clumps is probably an effective reason 
for the observed rise of the SFE in normal Main Sequence galaxies from 
z=0toz= 2 (refs 3, 4, 29), given the increasing prevalence of clumps at 
high z. Furthermore, the violent burst-like behaviour that young 
clumps show at formation is consistent with simulations predicting 
that, because of their rapid collapse, giant clumps could form globular 
clusters by converting gas into stars faster than stars expel the gas””. 

The short visibility window at high EWs (< 10 Myr, independently 
of the star formation history) has probably prevented the detection 
of the formation phase of clumps before now. From this timing 
constraint and the single discovery in our survey, we attempted an 
estimate of the clump formation rate of 2.5 Gyr * per galaxy (for 
Maump 22.5 X 10°M.o; Methods). Given the observation of one or 
two clumps per galaxy with similar masses'*'”"’, this converts into a 
lifetime of ~500 Myr (Methods). This is longer than expected in 
models of clump destruction by stellar feedback"*”’. Instead, it is rep- 
resentative of the timescale needed for giant clumps formed in galactic 
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Figure 4 | Numerical simulations of a high-redshift clumpy galaxy seen face- 
on. a, b, Maps of stellar mass (a) and SFR (b) at HST-like resolution. All clumps 
have elevated SFR compared with M,, but this property is extreme for clump A, 
observed 12 Myr after its formation. c, Time evolution of the SFR (black solid 
line) and M, (black dashed line) for clump A and for all the clumps (red solid 
line, average SFR; red dashed line, average My); each star formation history is 
arbitrarily shifted in time to align the SFR peaks. All clumps experience an 
internal burst of star formation before evolving into a long-lasting regulated 
regime within 20 Myr. Yellow shaded regions indicate s.d. uncertainties. 


disks to migrate inwards through dynamical friction and gravity tor- 
ques and to coalesce to grow the central galactic bulges”’”. 

Our study demonstrates the detectability of ultra-young clumps in 
deep surveys, indicating low formation rates and long lifetimes. This is 
crucial to an understanding of key issues of galaxy formation and 
evolution such as clump migration, bulge formation and the role of 
feedback. However, future observations of larger samples of forming 
clumps with direct measurements of the sizes, gas masses and velocity 
widths (and hence dynamical masses) of clumps are required for a 
complete understanding. This should be within the capabilities of the 
complete Atacama Large Millimeter Array and James Webb Space 
Telescope. We note that spectroscopic surveys targeting high-z gal- 
axies (for example Sinfoni integral field spectroscopy (SINS) and 3D- 
HST) have not yet reported the identification of giant clumps at forma- 
tion. This might suggest that they are rarer events than suggested by 
our survey, which allowed us to identify a direct signature of massive 
clump formation by means of gravitational collapse. 

Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Emission line maps. The 16 HST/WFC3 orbits of G141 slitless spectroscopy, 
taken along three position angles (~ 0°, —30° and +15°)"”, were reduced with 
aXe*!. Residual defects (such as bad pixels and cosmic ray hits) were removed with 
L.A.Cosmic”. Two-dimensional spectra were background subtracted with 
SExtractor*’, and the continuum emissions of the main target and surrounding 
sources (including higher and lower order dispersion spectra) were removed by 
fitting their aXe continuum models with free normalization (Fig. 1). 

Astrometrically calibrated emission line maps were obtained by cross-correl- 
ating the spectral images of [O m1] (the brightest line) with the three different 
position angles. This is preferred over cross-correlating with the continuum image 
because our target has different broadband and line morphologies. For this step, 
the spectral images were combined with the IRAF task WDRIZZLE™, weighting 
each single orientation by its exposure time. The astrometry of the HB and [O 11] 
emission maps was tied to that of [O m1]. The resulting redshift agrees accurately 
with Subaru/MOIRCS longslit spectroscopy’. 

The [O 11] doublet is resolved at the spectral resolution of our data for relatively 
compact galaxies. We removed the [O 11]A4959 A component by modelling the 
combined emission line images with GALFIT”** using an effective point-spread 
function (PSF) consisting of a main lobe for the 5007 A line and three fainter ones. 
Clump continuum emission. Visual inspection of the multi-band HST imaging 
did not reveal any evidence of the clump, and the evaluation of the isophotal 
contours of the galaxy images did not show disturbances at its location. Thus 
we searched for its presence by modelling the imaging with GALFIT (Extended 
Data Fig. 3). A single Sérsic”’ profile provided a simplified fit, leaving strong 
positive and negative residuals near the expected position of the clump. Such a 
pattern is a systematic effect as a result of the presence of clumps at the outskirts of 
the galaxy major axis, because they are not symmetrically located with respect to 
the nucleus, resulting in an effective bending of the galaxy isophots. Masking the 
external regions and fitting the central part of the galaxy with a single Seérsic profile 
left negligible residuals (<5%). Asa further check, we fitted the direct images with 
the Multi-Gaussian Expansion parameterization (MGE) algorithm”, fitting aver- 
age azimuthal light profiles with ellipsoidal isophots to the central part of the 
galaxy. The residuals are negligible (<5%). Analogous residuals resulted from 
the use of three spatially offset Sérsic profiles: one centred at the barycentre of the 
stellar light (as determined by SExtractor from the F140W image) and other two, 
an order of magnitude fainter, to the top left and bottom right. This is our best fit 
(baseline) model for the galaxy continuum. 

This three-component fit is a technical solution adopted because of the irregular 
morphology of our target, typical of clumpy high-z disks, and should not mislead 
one to conclude that the galaxy is an ongoing merger. In this regard, we classified 
the galaxy as a disk based on the Asymmetry and Mj, parameters measured on 
stellar mass maps derived from pixel-to-pixel spectral energy distribution (SED) 
fitting?’ *' (Extended Data Fig. 4), a diagnostic calibrated with MIRAGE numerical 
simulations”. Finally, the F105W/F140W ratio (Extended Data Fig. 2) provides no 
evidence for a bulge. 

Limits on the clump continuum were obtained with simulations, injecting PSF 
components at approximately the same isophotal level as that of the expected 
clump position, and fitting them together with our baseline model. From these 
estimates we subtracted the contribution of emission lines ({O m] and HB for 
F140W; [O 1] for F105W), obtaining factors of 2 and 1.3 deeper flux upper limits 
for F140W and F105W, respectively. Normalizing a series of Starburst99 stellar 
population synthesis models* with different stellar ages to the most constraining 
(F105W) upper limit allowed us to refine the F140W limit, which is relevant for 
calculating the [O 1] and HB emission line EWs (Extended Data Fig. 5). 
Clump offset from the galaxy nucleus. The clump is offset from the galaxy 
centre: the observed distance between the point-like [O 11] emission and the 
barycentre of the galaxy is 1.6 kpc, with formally negligible measurement error. 
However, there are systematic uncertainties; these are related to the astrometric 
calibration of the direct imaging and slitless data and to the stability of the wave- 
length solution. We estimated the systematic uncertainties along the dispersion 
direction evaluating the distribution of differences between the measured and 
expected wavelengths of bright emission lines (Ha and [O m1]) of the full survey 
data. By comparing the position of galaxies in the direct imaging with that of the 
continuum emission in the grism data we evaluated the systematics in the cross- 
dispersion direction. For each orientation of the grism we imposed 
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where Ngoris the degrees of freedom, Smeas,i aNd Sexp are respectively the measured 
and expected positions of the emission lines (or the continuum) of each galaxy, 
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and op and a, indicate respectively the formal measurement errors on the emis- 
sion lines (or continuum) positions and the astrometric uncertainties. Average 
systematic uncertainties are 7, = 0.067” along the dispersion direction and o, = 
0.035” along the cross-dispersion direction. We computed the uncertainties along 
the right ascension and declination directions projecting along the orientation of 
each data set. Because the final, astrometrically calibrated, emission line maps are 
the weighted average of three different orientations, we estimated the total uncer- 
tainties assuming that the errors (¢;) in each orientation were independent: 
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where t; represents the exposure times. The clump offset is detected at 7.60 and its 
projected distance from the galaxy nucleus (defined as the barycentre of the stellar 
light) is 1.6 + 0.3 kpc. If we had chosen the light peak of the direct images as the 
nucleus, the offset would be comparable in magnitude and significance. We prefer 
the light barycentre definition because it coincides with the peak of the mass map 
(Extended Data Fig. 2). 

To determine the deprojected distance, the axial ratio of the galaxy and the angle 
0 between the galaxy major axis and the clump nucleus direction are needed. We 
estimated them from the range of solutions obtained by modelling the direct 
images and the mass map with GALFIT and considering the outer isophots of 
the PSF-deconvolved galaxy. To further account for systematic effects we also 
considered plausible uncertainties in the PSF derivation, and further estimates 
based on the MGE software as an alternative to GALFIT. Given an axial ratio 
0.21 = q = 0.35 (inclination i ~ 70-78°) and 48 = 0 = 52°, we computed a 
maximum plausible range for the deprojected distance of the clump from the 
nucleus of 3.6 = d = 6.2 kpc, beyond the galaxy’s effective radius R. = 2.8 + 
0.4 kpc (Extended Data Table 1). We did not account for the disk’s thickness: this 
uncertain correction could imply a larger deprojected distance by 10-15% (for a 
typical thickness of a few hundred parsecs). 
Dust reddening. Estimating emission line luminosities and SFRs requires correc- 
tions for dust extinction (although this is less relevant for emission-line EWs, 
which are affected only by the differential line versus continuum reddening). 
We used stellar population modelling of the ultraviolet-to-near-infrared galaxy 
SED“, assuming the reddening law in ref. 45 and constant star formation histories 
(SFHs) to measure the stellar continuum reddening. We converted this measure 
into nebular reddening by using E(B— V) nebular = E(B— V)continuum/0.83 (ref. 46), 
obtaining E(B—V)nebular = 0.301): Independent estimates of the nebular red- 
dening were also obtained on the basis of emission line ratios: Ho/H, assuming 
case B recombination conditions’; [O 11]/Ha, assuming an intrinsic ratio of 1 
(ref. 48), and [O u]/HB, with an intrinsic ratio estimated by following the 
previous points. For these estimates we used Ho fluxes from MOIRCS, [O 11] 
from WFC3, and HB from the weighted average of MOIRCS and WFC3, 
obtaining E(B—V) yong = 0.24 + 0.12, E(B-V)to nye = 0.32 + 0.11, and 
E(B—V){o mp = 0.40 + 0.25. The average of these estimates is nearly identical 
to that from the stellar continuum. We therefore adopt E(B— V) nebular = 0.30. 

For the clump, a reddening estimate can be obtained by using WFC3, from the 
ratio of the [O 1] and Hf line fluxes. We derived a fairly noisy measurement 
consistent with that of the whole galaxy (E(B—V)[o m/up.clump = 0.24 + 0.37). 
Although formally this is also consistent with zero attenuation towards the clump, 
it is unlikely because the galaxy is highly inclined. To improve the estimate of the 
reddening affecting the clump, we attempted a derivation of the Ha flux of the 
clump in the MOIRCS data, decomposing the two-dimensional spectrum with a 
PSF-like component for the clump and a single Sérsic profile accounting for the 
host galaxy disk, finding Ha = (7 + 2) X 10°” ergs’ cm *, ~50% of the 
galaxy’s Ha emission’’. By averaging the reddening estimates from Ha/Hf, Ha/[O 
u] and [O 11]/HB, we obtained E(B— V)nebutar,clump = 0.55 + 0.20, consistent with 
the reddening of the host galaxy. We thus assumed that the clump nebular red- 
dening was identical to that of the parent galaxy, which is consistent with the 
literature’. Extended Data Fig. 2 shows the observed F606W/F105W ratio, prob- 
ing the stellar continuum reddening, which is homogeneous over the galaxy. The 
optical attenuation (Ay) at the clump position is similar to that at the galaxy 
nucleus within 0.1-0.2 mag, and is close to the galaxy average. The position of 
the galaxy nucleus (measured as the light barycentre, light peak or with GALFIT) is 
stable and does not change with wavelength from F606W to F105W and F140W. 
The [O mm] and F105W continuum should be affected by a similar attenuation and 
much less than the F606W continuum. Together with the flatness of the reddening 
map, this demonstrates that the clump emission lines were not an artefact due to 
modification of the galaxy nucleus position by reddening, because an even stron- 
ger effect would have been seen in F606W. Correcting the emission line maps and 
the imaging for reddening does not significantly alter the nucleus-clump distance. 
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Adopting the extinction law in ref. 50 (see ref. 51 for example) would produce 
reddening values < 15% higher, consistent within the uncertainties. 

Discarding the active galactic nucleus, shock, transient, and low-metallicity 
region hypotheses. The galaxy has three Chandra photons (one soft and two hard; 
~2e detection) in 146 ks data, giving L210 kev ~ 2.9 X 10 erg s_' (photon index 
I’ = 1.8). This is tenfold the value expected from galaxy star formation™. If an 
active galactic nucleus (AGN) were present, it would produce* an [O 1] lumin- 
osity ~20-fold fainter than that of the clump. In Extended Data Fig. 6 both the 
entire galaxy and the clump are located in the BPT diagram™ (we conservatively 
use [N Uadump < [N M]gataxy)- The emission line ratios are consistent with star- 
forming galaxies at z ~ 2 (ref. 51). The [O m]/[N 11] < 2.8 upper limit is also much 
lower than typically observed in type 1 AGNs”’. The high EW further disfavours 
the hypothesis of an off-nuclear AGN, because AGNSs typically have EWjo mj < 
500 A (ref. 56). Besides, no AGN signature was found from the galaxy’s SED, and 
no excess possibly arising from nuclear accretion was detected in our deep 24 um 
Spitzer, Herschel and VLA data. 

The clump’s emission-line luminosity is comparable with that of the whole 
galaxy, so it cannot be due to shock from external outflows impacting the gas. 
The host SFR would generate ~30-fold weaker galaxy-integrated, shock-excited 
line luminosities”. The brightest shock-powered off-nuclear clouds in local infra- 
red luminous galaxies are >50-fold weaker**. Explicit calculations” for z = 2 
galaxies, using appropriate wind mass loads and velocities”, lead to analogous 
conclusions. The kinetic energy available in winds cannot account for the clump 
line luminosities. 

There is no evidence for substantial line luminosities variability over a ~3-year 
timescale. HST/WFC3 G141 spectroscopy was obtained in June and July 2010, and 
MOIRCS spectroscopy in April 2013 (Extended Data Table 2). Despite their lower 
resolution (0.6” seeing), MOIRCS spectra show the bright, compact [O 1] and Ha 
emissions from the clump, with a consistent flux. 

Low-mass (<10’Mo), very metal-poor galaxies (Z ~ 0.1Z.5) can display extre- 
mely high EW emission lines*’. Our target is substantially more massive and 
metal rich: using the [O m]/[O u] ratio we estimated™ Z ~ (0.4 + 0.1)Zo and 
Z = (0.6 + 0.2)Zo, for the clump and the galaxy, respectively (Extended Data 
Fig. 4 and Extended Data Table 1). 

Constraining the age of the clump. We computed the time evolution of the HB 
EW by using stellar population synthesis models”, adopting Z = 0.4Zo, a 
Salpeter® initial mass function and three different SFHs: an instantaneous burst, 
constant star formation and a SFH obtained from our hydrodynamic simulations 
(Fig. 3). All models show high EWs at young ages (log(EW) > ~2, independently 
of the SFH), which decrease quickly for the instantaneous burst and more 
smoothly in the other cases. We converted the HB EW into the expected [O 1] 
and [O 1] EWs (Fig. 2), assuming the [O 1]/HB ratio of z = 2 star-forming 
galaxies*’ and an Ha/[O 1] luminosity ratio of 1 (ref. 48). Comparison of these 
values with the EW lower limits yielded an age of <10 Myr for the clump (Fig. 2). 

The directly measured continuum upper limits (rather than the more stringent 

ones from synthetic spectra; Extended Data Fig. 5), give an age of $15 Myr. 
Decreasing the adopted E(B—V) reddening by 0.1 dex would decrease the 
line EW lower limits by only ~0.05 dex but would increase the Lyp/L;,500 limit 
by ~0.1 dex, therefore hardly affecting the age constraints. Changing the metalli- 
city by 1.6 dex produces only a 0.2 dex age difference. Similarly, the age remains 
unchanged by adopting, for example, a Kroupa® or a Scalo® initial mass function, 
and a top-heavy one produces EWs 0.2 dex higher. 
SFR estimate. The SFR of the whole galaxy was determined from the total Ha 
luminosity from the MOIRCS spectroscopy, assuming the standard Kennicutt 
conversion®, resulting in (77 + 9)Mo yr, in agreement with that from SED 
fitting (~85Mo yr! with an uncertainty of 0.2 dex; Extended Data Table 1). 

The time-dependent conversion of line luminosity to SFR at young ages was 

computed with Starburst99, adopting the SFH from our numerical simulations 
(Extended Data Fig. 7). At t = 10 Myr this is 20% higher than from the Kennicutt 
conversion. By averaging the estimates from HB, [O 11] and Ha we obtained SFR = 
(32 + 6)Mo yr! for the clump, where the error includes the uncertainties 
associated with emission-line luminosities and reddening. 
Stellar mass estimate. Assuming the average mass-to-light ratio (M/L) of the host 
galaxy (Extended Data Fig. 2), the flux upper limit on the continuum emission of 
the clump implies My < 3 X 10°Mo. Using the M/L ratio from the clump SFH 
(Extended Data Fig. 7) gives My < 2.1 X 10°Mo. Normalizing the simulations to 
the observed Hf luminosity yields My ~ 3.9 X 108M, which is consistent with 
the previous estimates given the uncertainties (a factor ~2, mainly due to the gas 
fraction of simulated galaxies and the details of feedback and stellar mass loss 
modelling at small scales; note that the simulated clumps seem on average to be 
slightly less massive than our observed one, but their observed physical properties 
and time behaviour are self-similar). 


Gas mass estimate. We inferred an upper limit to the gas mass in the clump from 
the Jeans mass (My) of the galaxy, which is close to the maximum mass of gas that 
can collapse in a rotation disk*’”°. Assuming a reasonable upper limit for the 
typical gas velocity dispersion in high-z disk galaxies (o, < 80 km s~' (refs 21, 
22)), we obtained M,,s S My = 2.5 X 10°Mo. Using the M,,s/HB ratio from 
simulations leads to a gas mass of Mgas ~ 2.7 X 10°Mo. Comparison with older 
clumps from the literature’, using our numerical simulations to relate the physical 
properties at the ‘peak’ and later phases, yields 


SFRetump Mgas,young SFRyoung 
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where SFR), and My. ji refer to older clumps reported in the literature; My oiasim 
and SFRoigsim are for old clumps in the simulations, and Mgasyoungsim and 
SFRyoungsim are computed at t = 10 Myr, as our young clump. This approach 
leads to Mgas = 3 X 10°Mo + 0.2 dex, consistent with the independent estimates 
discussed above. This agreement supports SFHs with an initial burst as predicted 
by our simulations. 

The Schmidt-Kennicutt relation can be used to provide alternative estimates, 
on the basis of the clump SFR. The relation for Main Sequence galaxies would 
imply that ~50% of the total gas in the galaxy is collapsing in an ultra-compact 
region, which seems hardly believable, confirming that this young clump has a 
higher SFE. Assuming instead the starburst-like relation, we obtained Mga, = 
2x 10° 038M ©, consistent with the previous estimates. 

Considering an even younger age for the clump, as permitted by the upper limit 
t < 10 Myr, would return higher absolute and specific SFRs (Extended Data Fig. 7; 
that is, higher SFR/Ly4g), confirming the starburst behaviour of the clump during 
the formation phase. 

Dynamical time estimate. By measuring the full-width at half-maximum 
(FWHM) of the Ha line detected in the MOIRCS longslit spectroscopic data, 
we determined a first upper limit on the gas velocity of the clump vgwum < 
450 km s_' (the MOIRCS instrumental resolution). Given the upper limits on 
the radius of the clump (R < 500 pc) and on its dynamical mass (Mayn = Mgas + 
My < 2.8 X 10°M.), we then refined our estimate to vewum < \(MaynG/R) ~ 
200 km s~! (where G is the gravitational constant), consistent with clump velo- 
cities typically observed in high-redshift galaxies'’. This leads to a dynamical 
timescale tay, = 27R/(Vpwum/2) ~ 29 Myr, in reasonable agreement with the 
free-fall time of the clump te ~ | (R°/Mayn) ~ 17 Myr. 

Clump formation rate and lifetime. The visibility window of the young phase 
can be defined as the time during which the EW is above a given threshold, as 
predicted by stellar population synthesis models. For our clump this ranges 
between ~5 Myr (instantaneous burst) and ~10 Myr for SFH from simulations. 
We used an average visibility window of 7 Myr. 

Knowing the visibility window and the observed number of ‘formation events’ 
per galaxy, the ‘clump formation rate’ can be estimated; comparing it with the 
average number of descendants observable per galaxy (virtually all old) yields the 
average lifetime of the clumps. 

We considered all galaxies in our survey with, first, My > 8.5 X 10°Mo (mass 
completeness, coinciding with the minimum mass that a galaxy should have to 
host such a massive clump assuming a gas fraction of ~50%); second, My <2 X 
10''Ms ([O um] emission becomes too weak at higher masses’'); and third, a 
redshift 1.2 < z < 2.4 ([O m1] emission lying inside the wavelength range of the 
grism). In this way 57 galaxies were selected. With one ‘forming clump’ detected, 
this corresponds to a ‘clump formation rate’ of 2.5 Gyr’ per galaxy. 

We considered that in our survey we would have detected all formation events 
of clumps with M,,, 2 2.5 X 10°Mo. Considering that almost all the initial gas 
mass of a given clump is consumed at initial stages to form stars, the clump 
stellar mass at late stages can be approximated to the gas mass at initial collapse, 
independently of the age of the clump, as supported by our numerical simula- 
tions. Typically there are about one or two clumps per galaxy above this mass 
threshold’*'”"*, giving an average lifetime of 500 Myr. 

To compute the (large) associated uncertainty, we considered the Poisson error 
associated with our single object discovery, the Poisson error for older clumps 
from the literature, and the visibility window uncertainty. The asymmetric lo 
uncertainties that we inferred are +0.74 dex and —0.55 dex. The lower envelope 
of the 1o range of the lifetime estimate is not far from the upper range of lifetimes 
suggested by models in which clumps undergo strong feedback (50-100 Myr). 
However, our estimate is probably a lower limit. The derived lifetime could be 
affected by a ‘discovery bias’, because other high-redshift spectroscopic surveys 
(such as SINS and 3D-HST) have not yet reported the observation of a similar 
giant young clump. Furthermore, there are indications* that our target galaxy is 
living in a gas-enriched environment, which could also have anomalously 
increased the SFR and thus the ‘clump formation rate’. This suggests that the 
observation of a newly formed giant clump could be an even rarer event than 
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suggested by our data, and that the true average clump lifetime could be longer 
than estimated here. 

Code availability. The RAMSES code used to generate our simulations is available 
at http://www.ics.uzh.ch/~teyssier/ramses. 
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Extended Data Figure 1 | GALFIT decomposition of the [O m1] emission source component. The positions of the nucleus and of the clump are shown 
line map. a, b, [O 1] map (a) and model of the point source component (b) for _as crosses. 
the clump. c, No strong residuals or artefacts are left after removal of the point 
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Extended Data Figure 2 | Image ratios and mass map. a,b, Ratio of FI05W/ —_ nucleus and clump positions are respectively 1.34 and 1.16 for F105W/F606W, 


F606W imaging in spectral flux density (F,) scale (a), a proxy for the dust and respectively 1.39 and 1.25 for F140W/F105W. Galaxy-wide ratios are 
reddening of the stellar continuum, and F140W/F105W imaging, sensitive to _ respectively 1.27 and 1.37 for Fl10O5W/F606W and F140W/F105W. c, Mass map 
the M/L ratio (b). The positions of the nucleus and the clump are shown as in units of Mg per pixel. 


crosses. The maps show only small variations: the observed F, ratios for the 
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Extended Data Figure 3 | Modelling of the galaxy light profile. F140W 
direct image and [O m1] emission line map (a, d, g, j), GALFIT models 

(b, e, h, k) and residuals (c, f, i, 1). The first row shows the single Sérsic profile 
solution, the MGE model is in the second row, and our baseline model (the sum 


of three Sérsic profiles, in which blue crosses mark the additional components) 
is in the third row. The red cross indicates the barycentre of the stellar light and 
the green cross marks the centre of the [O m1] off-nuclear component. 
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Extended Data Figure 4 | The Asymmetry and M9 morphological 
parameters as determined from the spatial distribution of the galaxy stellar 
mass. Pink and light blue triangles represent disks and mergers from MIRAGE 
numerical simulations”, respectively. The galaxy presented in this work (red 
filled circle with error bars indicating s.d.) is located in the typical region 
occupied by disk galaxies*’; the vast majority of mergers have higher 
Asymmetry and/or M29 parameters. The figure shows the same number of 
mergers and disks even if mergers are expected to be a minority in optical 
samples. 
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Extended Data Figure 5 | Upper limits to clump continuum flux. The 

observed flux upper limits estimated from simulations and GALFIT modelling 
in the three bands are shown as black filled circles. The black horizontal lines 
indicate the bandpass width of each filter. Coloured curves represent reddened 


Starburst99 stellar population synthesis models* with different ages (from 8 to 
20 Myr), normalized to the most stringent upper limit (F105W band). The 
corresponding upper limits in F140W and F606W, obtained considering a 
spectrum with an age ~10 Myr, are shown as grey filled circles. 
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Extended Data Figure 6 | Emission line diagnostics. a, BPT diagram™ galaxy are measured from the Subaru/MOIRCS longslit spectroscopy follow- 


showing that the emission line ratios of the whole galaxy and of the clump (red —_ up, and the [N u]/Ho upper limit for the clump is computed by assuming the [N 
and light-blue points with error bars indicating s.d.) are consistent with being _ 11] of the whole galaxy. b, Determination of the metallicities of the whole galaxy 
powered by star formation. The [N u] upper limit and Ha emission ofthe whole and that of the clump from the [O 1]/[O u] ratio™. 
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Extended Data Figure 7 | Time evolution of physical quantities based on the 
clump SFR(t) from our simulations. a, The peaks of all the curves normalized 
to 1 to highlight the time delay occurring between the peak of the SFR and of the 
luminosities Lj, L1,500 A and Ls,o99 A. b, Peaks normalized to 1 at t = 1 Gyr to 
stress the relative intensity of the observables at the peak and later phases. The 
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vertical black dotted line indicates the upper limit on the age of the clump (f = 
10 Myr). The units of the plotted quantities are: SFR (Mo yr‘), EWyugp (A), 
Lup, £1,500 A and Ls,oo0 A (erg s'), Mx/Ls,000 A(Mo erg s) and SFR/Lyp (Mo 
yr ‘erg 's). 
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Extended Data Table 1 | Properties of the galaxy and the clump 


Galaxy ([D568) Clump (Vyc1) 


Right ascension [h m s| 14:49:12.578 14:49:12.575 
Declination [° ’ ”] +8:56:19.42 +8:56:19.62 
Re [kpc] 2.8 + 0.4* < 0.5 
SFR [Mo/yt| 77+9 32 +6 
log(M,/Mos) 10.3475 < 8.5 
log(Mgas/Mo) 10.7 + 0.27 < 9.4 
Z: [Zo 0.6+0.2 0.4+0.2 
Froim [107 ergs" *cm~*| 10.4 + 0.7 4.3 + 0.2 
Fer f10-27erg 5 tom7?] 1.5+0.8 0.9+ 0.3 
From [10 erg s~*em~?] 6.54 1.7 1.9+ 0.6 
Be ace 67.5 + 3.4% <11 
[10-*°ergs-tem-? A-}] 

obs 
Bora 89.2 + 4.6% < 18 
[10-7°erg s~tem-? A}] 

obs 
Bo 212.3 + 10.64 < 4.5 
[10— = s—tem-? — 
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Extended Data Table 2 | HST/WFC3 and Subaru/MOIRCS observations 


Instrument Date Time Time 
(direct imaging) (spectroscopy) 

(hr) (hr) 

HST /WFC3 2010, 6? June 0.3 (F140W) 2.7 

HST/WFC3 2010, 25" June, 1% July _—(0.6 (F140W) 7 

HST /WFC3 2010, 98 July 0.3 (F140W) a7 

HST /WFC3 2013, 20° May 3.3 (F105W) - 

HST /WFC3 2013, 20° May 0.3 (F606W) 2 

Subaru/MOIRCS 2013, 7% - 9% April £ 3 


©2015 Macmillan Publishers Limited. All rights reserved 


es A A ee 


doi:10.1038/nature14368 


Curtain eruptions from Enceladus’ south-polar 


terrain 


Joseph N. Spitale’, Terry A. Hurford?, Alyssa R. Rhoden*, Emily E. Berkson‘ & Symeon S. Platts” 


Observations of the south pole of the Saturnian moon Enceladus 
revealed large rifts in the south-polar terrain, informally called 
‘tiger stripes’, named Alexandria, Baghdad, Cairo and Damascus 
Sulci. These fractures have been shown to be the sources of the 
observed jets of water vapour and icy particles’* and to exhibit 
higher temperatures than the surrounding terrain®*®. Subsequent 
observations have focused on obtaining close-up imaging of this 
region to better characterize these emissions. Recent work’ exam- 
ined those newer data sets and used triangulation of discrete jets’ to 
produce maps of jetting activity at various times. Here we show 
that much of the eruptive activity can be explained by broad, cur- 
tain-like eruptions. Optical illusions in the curtain eruptions 
resulting from a combination of viewing direction and local frac- 
ture geometry produce image features that were probably misinter- 
preted previously as discrete jets. We present maps of the total 
emission along the fractures, rather than just the jet-like compon- 
ent, for five times during an approximately one-year period in 2009 
and 2010. An accurate picture of the style, timing and spatial dis- 
tribution of the south-polar eruptions is crucial to evaluating the- 
ories for the mechanism controlling the eruptions. 

At high resolution, the material emitted by Enceladus’ south-polar 
fractures is not adequately represented by discrete ‘jets’, as was done in 
previous studies*’. Rather, much of the emission seen in the images 
appears as broad vertical curtains extending over many kilometres of 
fracture, with scale heights ofa few kilometres. Intermittent prominent 
jets with much larger scale heights, and thus greater visibility, appear to 
be superimposed on the curtain structure. A recent study’ used tri- 
angulation of jet-like features among multiple Cassini images to map 
eruptive activity, but did not specify how, or whether, the ubiquitous 
curtain eruptions were treated. 

We examined the eruptions by simulating the appearance of cur- 
tains of material emanating from extended sources on the surface, as 
demonstrated in Fig. 1. Figure 1a shows Cassini image N1671564077 
taken on 2010-354 (year-day; upper image), and a synthetic image in 
the same viewing geometry with rays projected vertically from sample 
points on Baghdad and Damascus Sulci spaced 250 m apart (lower 
image). The rays have uniform ground-level intensities and a uniform 
scale height of 4 km. The camera scale is about 750 m per pixel. The 
shadow of Enceladus is visible on the synthetic curtains and compares 
well with shadows in the image, demonstrating a correspondence 
between the observed plume material and the mapped fractures. No 
other visible surface features produce curtains that match the pattern 
seen in the image. 

Despite the uniform sampling of the curtains in Fig. 1a, which is well 
below the projected camera scale, the simulated curtains display an 
array of fine jet-like features similar in appearance to fine structure 
seen in the Cassini image. Moreover, some of the broader structures 
line up between the real and synthetic images. These ‘phantom’ jets 
arise from variations in curtain optical thickness caused by the mean- 
ders in the source fractures relative to the line of sight. The features in 


the synthetic image are all phantoms; the correspondence with the real 
image suggests that much of the discrete structure in that image is also 
illusory. 

Figure 1b shows a more striking example of the phantom jet phe- 
nomenon (top, observation; bottom, synthetic image). The scale height 
is 1 km and the image scale is 80 m per pixel. In this case, nearly every 
prominent bright feature in the Cassini image corresponds to a 
phantom jet in the simulated curtains. Finer-scale image features are 
seen where finer-scale phantoms occur in the synthetic image, though 
aliasing is a possibility at this image scale. 

The presence of phantoms raises additional concerns about repre- 
senting the observed activity in terms of discrete jets only and using 
triangulation to locate their sources’. First, as shown in Fig. 2, the 
apparent source location of a phantom depends on the viewing angle, 
so triangulation is inappropriate. Attempting to triangulate a phantom 


N1671564077 | b 


N1637461416 


Figure 1 | Curtain simulations illustrating the phantom jet phenomenon. 
a, Top, Cassini image N1671564077 stretched and cropped to focus on jet 
curtains from Baghdad and Damascus Sulci. Bottom, synthetic image in the 
same geometry with uniform jet sheets from Baghdad and Damascus Sulci. The 
high-frequency brightness variations in the simulated image are phantom jets. 
Some larger image structures may also be phantoms. b, Top, Cassini image 
N1637461416 stretched and cropped to make the erupted material visible. 
Bottom, simulated uniform curtains overlain on the unstretched image. 
Phantom jets in the synthesized curtains correspond well to regions of 
enhanced brightness in the image. 
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Figure 2 | Schematic illustrating the dependence of phantom jet locations 
on viewing angle. Assuming the fracture is erupting uniformly, images 
obtained from directions A, B and C will show phantoms at locations a, b and c 
respectively, where the sight lines are tangential to the fracture. Triangulation 
among the various combinations of sight lines produces solutions at locations 
ab, ac and bc, where the respective ground tracks intersect. Because the 
apparent location of a phantom depends on the viewing angle, attempts to 
triangulate phantoms produce spurious solutions whose accuracies diminish 
with increasing parallax. In contrast, the precision of triangulation of stationary 
objects increases with increasing parallax. 


produces a spurious solution whose accuracy diminishes as the par- 
allax increases. Such solutions are most accurate at low divergence 
angles, where triangulation is least precise. Second, there is no proof 
of enhanced emission at any of these locations, so the presence of 
phantoms is likely to bias the outcome: activity may be preferentially 
reported where phantoms appear. 

For our maps, we identified active fractures by simulating curtains 
originating along obvious candidate fractures to compare with the 
emission seen in Cassini images. The best information came from 
the shadows cast on the erupted material rising into the sunlight 
because they provide a unique constraint on the location of the fracture 
responsible for the emission, though there were many cases where it 
was possible to identify the source fracture by looking at the emission 
on the sky, or because the fracture itself was visible and emitting. 
Unique determination of relative eruptive intensities along a fracture 
is not usually feasible because phantom jets and local variations in 
eruptive intensity often look the same. This approach captures both 
the jet-like and curtain-like eruptions because discrete jets are a special 
case of the curtain geometry. 

On days 2010-138 and 2010-354, when the viewing direction was 
within ~20° of the fracture direction, simple collimated vertical cur- 
tains were not adequate to explain the observations. On Baghdad 
Sulcus on both days, a curtain spreading with an opening angle of 
about +10° gave a better match to the observed shadow edge and to 
the observed shape of the curtains (Fig. 3), than to the collimated 
curtain. On Damascus Sulcus on day 2010-138, a spreading of +14° 
with a zenith angle of about 8° was required. On Cairo Sulcus on day 
2010-138, a spreading of +7° with a zenith angle of about 3° was 
required. Other viewing geometries were not sensitive to these para- 
meters. 

The observed geometry variations among the curtains may reflect 
differences in the vent/reservoir geometries. For example, fractures 
may not be perfectly vertical, or reservoir depths and geometries 
may vary. Assuming that the observed curtain spreading reflects the 
gas dispersion, a deeper reservoir would allow particles more time to 
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Figure 3 | Shadow anomalies and spreading curtains. Left-hand column, 
reference images are shown with no overlays. Middle column, simulated 
vertical collimated curtains are overlain. Right-hand column, simulated 
curtains with the indicated spreading and zenith angles are overlain. On 
Baghdad and Damascus Sulci, the shape of the shadows are more consistent 
with a spreading curtain than a collimated curtain. In every case, the shape of 
the observed curtain is consistent with a spreading with altitude. On Damascus 
Sulcus, the horizontal shadow right of centre is consistent with a fracture not 
seen in the base map, but which appears in a different Cassini image (Extended 
Data Fig. 1). 


interact with, and be affected by, the gas before exiting into space; a 
greater liquid surface area relative to chamber volume would produce 
higher gas velocities for a given temperature (that is, dispersion velo- 
city). 

Figure 4a—e shows maps of fracture activity at five times during an 
interval of about one year (day given at top right of each panel). We 
tested every plausible fracture in the base map and recorded whether 
the observed pattern of emission is consistent with only one candidate 
fracture (in which case the fracture is coloured green), or activity on a 
fracture cannot be determined (blue), or activity from a given fracture 
would produce a pattern of emission that is not detected in the image 
(red). The fracture mapped in cyan could not usually be distinguished 
from the nearby parallel fracture. Because of changing image para- 
meters and geometry, and varying shadow heights, the quantitative 
detection limits implied for locations mapped in red varies. Fractures 
that were never determined to be active are not included in the maps. 
Because our test locations were chosen from a finite set of candidates 
corresponding to features in the base map, the uncertainties in those 
locations correspond to and are correlated with the uncertainties in the 
base map, so we do not display those uncertainties over the base map. 

We produced an averaged activity map (Fig. 4f) by dividing the 
number of times each location was determined to be active (green) by 
the total number of times a unique determination could be made (green 
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Figure 4 | Stereographic projections of the south-polar terrain showing 
activity at five different times, and the average. a—e, Green areas are active, 
red areas are inactive, and blue areas are undetermined. One possible 
alternative segment on Baghdad Sulcus was mapped using cyan. f, Averaged 


or red). The alternative fracture mapped in cyan was excluded from the 
average. As expected, we find that the bulk of the emission originates on 
the four main tiger stripes and their major branches. The hottest regions 
detected by the Cassini Composite Infrared Spectrometer (CIRS; figure 3 
in ref. 6) are generally the areas in our averaged map that show the 
largest number of activity counts, though a cooler segment on Cairo 
Sulcus on the trailing hemisphere shows high activity counts in our 
averaged map and some warm areas were not observed to be active in 
any of our maps. However, given the small number of samples, those 
discrepancies may not be statistically significant. We also saw activity on 
fractures where no thermal anomaly is apparent in ref. 6. In particular, a 
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activity map, where the plotted value at each point is proportional to the total 
number of times activity was detected divided by the total number of times a 
determination could be made. Lighter colours represent larger values. Source 
locations from ref. 3 are indicated with white circles. 


lengthy network of fractures extending from Baghdad Sulcus towards 
Cairo Sulcus was active on day 2010-225. 

The curtain representation for Enceladus’ south-polar eruptions 
explains the broad emission seen in the images that cannot be mod- 
elled using discrete sources alone. The phantom jet phenomenon that 
emerges from that representation demonstrates that there are far fewer 
localized eruption sources along the tiger stripes than previously 
thought. Although prominent discrete jets with significant non-nadir 
zenith angles are still required to explain the activity seen in some 
images (for example, N1669800820), other images (for example, 
Fig. 1b) can be explained with almost no discrete jets. 


7 MAY 2015 | VOL 521 | NATURE | 59 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 
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and Source Data, are available in the online version of the paper; references unique 
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METHODS 

Data reduction. For this investigation, we used images from five Enceladus fly- 
bys, on days 2009-325, 2010-138, 2010-225, 2010-334 and 2010-354. The images 
were all taken in the clear filters and had camera scales ranging from ~1,500 m per 
pixel to ~50 m per pixel and phase angles greater than 142°. Images were radio- 
metrically calibrated using version 3.6 of the Cassini CISSCAL (Cassini Imaging 
Science Subsystem Calibration) software’, and where possible images were co- 
added to improve the signal relative to the noise. Pointing corrections were carried 
out using landmarks on the satellite; either the limb, or known features on the 
surface. The satellite was modelled as an ellipsoid with radii 256.6, 251.4 and 248.3 
km (ref. 1). 

Identifying active fractures. The procedure for simulating the curtains was a 
follows. First, candidate fractures were digitized by selecting a set of surface points 
ona base map (we used EN_120828_DLR_south_equator’) and re-sampling them 
to a 250-m spacing. Finer spacings were too CPU/memory intensive. For the few 
images with pixel scales finer than that, we were careful not to over-interpret fine- 
scale phantoms. Next, we produced a simulated image using the geometry of a real 
comparison image. To produce the simulated image, we projected a ray from each 
sample location on the surface of Enceladus. The intensity of the ray decreased 
exponentially with altitude, and points that were shadowed by Enceladus were 
assigned zero intensity. 

The mapping from Enceladus’ shadow on a curtain to the surface trace of the 
source fracture is effectively unique. The shadow is a cone formed by the projection 
of Enceladus’ terminator into space directly away from the Sun. Each point on the 
surface of the satellite maps to a unique point on the surface of the shadow cone by 
way ofa ray perpendicular to the satellite surface at that point. Because none of the 
images used in this study were taken during eclipse, rays projected from the 
camera that do not hit the satellite intersect the shadow cone at two points, 
implying emission from two possible surface locations. However, there is nearly 
always only one reasonable choice for the true source; the other choice is usually so 
much closer to the camera that the shadow height rules out any reasonable pos- 
sibility of detecting any emission. In this study, there were no cases where that 
degeneracy could lead to any confusion. 

Non-ideal curtain geometries. On days 2010-138 and 2010-354, the shadow cast 
on the curtain above Baghdad Sulcus was a poor match to shadows on ideal 
simulated curtains. We obtained a better match to those shadows by assuming 
the curtain spread with altitude with a +10° spreading angle (Fig. 3a and b). 
Spreading was modelled by projecting two collimated curtains from a fracture 
to represent the envelope of the spread curtain. Points internal to the envelope 
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were not included. As shown in the figure, the 10° spreading angle is also consist- 
ent with the observed shape of the curtains themselves in images that are sensitive 
to that aspect of the curtains. Figure 3 also shows that the curtains above Damascus 
and Cairo Sulci are better matched when spreading and small non-nadir zenith 
angles are included. Even better matches to the shadows could be obtained by 
allowing the spreading angles and zenith angles to vary along the fractures instead 
of assuming a simple model with uniform parameters. 

Unseen fractures. Another possible explanation for the anomalous shadows on 
the curtain above Baghdad Sulcus would be emission arising from additional 
fractures that are not apparent in the base map. Assuming vertical collimated 
curtains, the observed shadows can be used to trace out unique surface sources 
to arbitrary precision. We searched all Cassini images with surface pixel scales 
finer than 500 m and phase angles less than 135° for fractures that could 
produce the observed curtains near Baghdad Sulcus assuming vertical colli- 
mated emission and found no candidates to explain the entire observed pattern, 
though the base map does contain one fracture that could explain part of the 
anomalous shadow on day 2010-354. Moreover, while vertical collimated emis- 
sion from unseen fractures can match the curtain shadows, it does not match the 
observed shape of the curtains themselves, and it requires a different fracture to 
explain the anomalous Baghdad shadow on each day; the inclusion of a 10° 
spreading angle does a reasonable job of explaining both anomalous shadows 
simultaneously. 

An anomalous curtain shadow near Damascus Sulcus probably does imply the 
presence of a fracture not visible in the base map. In Fig. 3c, the prominent 
horizontal shadow near right of centre in the frame cannot be explained by any 
candidate fracture visible in the base map. A search for candidate fractures pro- 
duced the result shown in Extended Data Fig. 1. Note that because the locations for 
points along this fracture were inferred from the shadows instead of being selected 
from visible features in the base map, uncertainties for those points are independ- 
ent of the base map and are explicitly shown in the figure. Those uncertainties were 
derived from the half-width of the shadow in the image. 

Sample size. No statistical methods were used to predetermine sample size. 
Code availability. The analysis in this paper was performed using IDL-based 
software that is not currently available to the public. 
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Extended Data Figure 1 | Portion of image N1602274088 showing source 
locations inferred from anomalous emission near Damascus Sulcus. 
Diamonds are sample points inferred from the observed curtains; uncertainties 
are shown as circles about each displayed sample point. The inferred source 
locations correspond well with features that are not apparent in the base map. 
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Training and operation of an integrated neuromorphic 
network based on metal-oxide memristors 


M. Prezioso™, F. Merrikh-Bayat'*, B.D. Hoskins", G. C. Adam, K. K. Likharev? & D. B. Strukov! 


Despite much progress in semiconductor integrated circuit techno- 
logy, the extreme complexity of the human cerebral cortex’, with its 
approximately 10'* synapses, makes the hardware implementation 
of neuromorphic networks with a comparable number of devices 
exceptionally challenging. To provide comparable complexity while 
operating much faster and with manageable power dissipation, 
networks’ based on circuits** combining complementary metal- 
oxide-semiconductors (CMOSs) and adjustable two-terminal res- 
istive devices (memristors) have been developed. In such circuits, 
the usual CMOS stack is augmented with one’ or several* crossbar 
layers, with memristors at each crosspoint. There have recently been 
notable improvements in the fabrication of such memristive cross- 
bars and their integration with CMOS circuits* ’, including first 
demonstrations’*”” of their vertical integration. Separately, dis- 
crete memristors have been used as artificial synapses in neuro- 
morphic networks'*""*. Very recently, such experiments have been 
extended” to crossbar arrays of phase-change memristive devices. 
The adjustment of such devices, however, requires an additional 
transistor at each crosspoint, and hence these devices are much 
harder to scale than metal-oxide memristors'””?', whose nonlinear 
current-voltage curves enable transistor-free operation. Here we 
report the experimental implementation of transistor-free metal- 
oxide memristor crossbars, with device variability sufficiently low 
to allow operation of integrated neural networks, in a simple net- 
work: a single-layer perceptron (an algorithm for linear classifica- 
tion). The network can be taught in situ using a coarse-grain variety 
of the delta rule algorithm” to perform the perfect classification 
of 3 X 3-pixel black/white images into three classes (representing 
letters). This demonstration is an important step towards much 
larger and more complex memristive neuromorphic networks. 

In a hybrid CMOS/memristor circuit, the CMOS subsystem con- 
tacts each wire, and hence can address each memristor on the add-on 
crossbar(s), using a specific ‘CMOL  area-distributed interface**. The 
basic idea of hybrid neuromorphic networks—so-called CrossNets’— 
is to use this opportunity to connect CMOS-implemented hardware 
models of neuron bodies with the memristive crossbar(s), whose wires 
play the parts of axons and dendrites and whose memristors mimic 
biological synapses. The simple, two-terminal, transistor-free topology 
of metal-oxide memristors may enable CrossNets to achieve extremely 
high density—much higher than that of pure-CMOS neuromorphic 
networks (including those based on CMOS-modelled memristors”*, 
floating-gate** and ferroelectric*> memory cells), and even higher than 
that of their biological prototypes. For example, a CrossNet based on a 
hybrid CMOS/memristor circuit with five layers of 30-nm-pitch cross- 
bars, two memristors per synapse, and 10* synapses per neural cell 
would have an areal density of about 25 million cells per square cen- 
timetre, that is, higher than that in the human cerebral cortex, at com- 
parable average connectivity’. Estimates show that such CrossNets may 
also provide comparable power efficiency, at a much higher operation 
speed—for example, an intercell signal transfer delay of about 0.02 ms 


(compared to about 10 ms in biological systems) at an easily manage- 
able energy dissipation rate of about 1 W cm *. 

However, the practical implementation of such networks is still very 
challenging, owing to the specific physical mechanism of resistance 
change in most prospective metal-oxide-based memristors—a 
reversible modulation of the concentration profile of oxygen 
vacancies''”°?", On the positive side, the atomic scale of the vacancy 
position modulation implies the possibility of memristor scaling down 
to few-nanometre dimensions, which has been confirmed by recent 
experiments**”’. On the negative side, such a small scale makes the 
device-to-device reproducibility of device parameters, most impor- 
tantly the voltage required for memristor electroforming and 
switching”””', difficult to achieve with the currently used fabrication 
technologies. Device variability is the main reason why the only 
(to our knowledge) demonstrations of memristive neuromorphic 
networks were based on disconnecting each memristor from the 
crossbar for individual forming, using either a crossbar with external 
(off-chip) wires’*, or an individual switch transistor at each cross- 
point’’. Both these approaches are incompatible with the goal of 
reaching the extremely high density of neuromorphic networks dis- 
cussed above. 

The main goal of this work was an experimental demonstration of a 
fully operational neural network based on an integrated, transistor- 
free crossbar with metal-oxide memristors. To reach this goal, a large 
reduction of memristor variability was essential, and to achieve it, we 
used binary-oxide Al,O3/TiO, _ ,, stacks (see inset to Fig. 1b). Their 
fabrication procedure was generally close to that described in ref. 27, 
but with the important difference of using low-temperature (<300 °C) 
reactive sputtering for film deposition, which enables monolithic 
three-dimensional integration. The stack was first optimized by con- 
ducting an exhaustive experimental search over a range of titanium 
dioxide compositions and layer thicknesses (from 5 nm to 100 nm) to 
find the parameter range providing the lowest forming voltages. 
Within that range, the device performance—most importantly the 
memristor uniformity and the current-voltage curve nonlinearity— 
was further optimized by varying the aluminium oxide thickness from 
1 nm to 5 nm (Supplementary Information Section 1). 

The main feature of such optimized junctions is their low variability 
(Supplementary Figs 3 and 4). In addition, other important character- 
istics of the 200 nm X 200 nm formed devices are also desirable: the 
ON/OFF current ratios of above four orders of magnitude (at +0.1 V), 
high nonlinearity of the current-voltage curves (with the current at the 
switching voltage more than ten times the current at half of the switch- 
ing voltage), a switching endurance of at least 5,000 cycles, an esti- 
mated memory retention of at least ten years at room temperature, 
low forming (~2 V) and switching (~1.5 V) voltages, and relati- 
vely low operation currents of between ~100 nA and ~100 HA (see 
Supplementary Fig. 1). 

The optimized technology was then used to fabricate an integrated 
memnristive crossbar with 12 < 12 devices (Fig. 1), with a few process 
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Figure 1 | Memristor crossbar. a, Integrated 12 X 12 crossbar with an Al,O3/ 
TiO, — , memristor at each crosspoint. b, A typical current-voltage curve of a 
formed memrristor. c, Absolute values of conductance change under the effect of 


modifications to increase the metal electrode thickness, so that the line 
resistances were reduced to about 800 Q for the top layer of the cross- 
bar and 600 Q for its bottom layer. The crossbars retained the excellent 
uniformity of virgin (pre-formed) crossbar-integrated devices (see 
Supplementary Figs 3, 4 and 5), allowing individual electric forming 
and tuning of each memristor. The electroforming was performed by 
grounding the corresponding bottom electrode and applying a cur- 
rent-controlled ramp-up to the top electrode, while leaving all other 
line potentials floating (Supplementary Fig. 4). To minimize current 
leakage during the subsequent forming of other devices, each formed 
memristor was immediately switched into its low-current (OFF) state. 
The measured individual characteristics of the formed memristors 
were mostly similar to those of stand-alone devices, except for a some- 
what smaller (~100) ON/OFF current ratio. This difference may be 
partly explained by current leakage through other crosspoints at the 
measurements, and partly by the somewhat smaller switching voltages 
used for the crossbar to lower the risk of device damage. In addition, 
some deviations from the optimal device performance could be 
caused by the electron-beam evaporation of thicker electrodes, which 
required breaking of the vacuum, as opposed to the fully in situ sput- 
tering of single device layers, and their subsequent annealing (see 
Supplementary Information). 

The fabricated memristive crossbar was used to implement a simple 
artificial neural network with the top-level (functional) scheme shown 
in Fig. 2. This is a single-layer perceptron” with ten inputs and three 
outputs, fully connected with 10 X 3 = 30 synaptic weights (Fig. 2b). 


a 3x3 binary 
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for various pulse amplitudes. The inset in b shows the device cross-section 
schematically. 


As the scheme shows, the perceptron’s outputs f; (with i = 1, 2, 3) are 
calculated as nonlinear ‘activation’ functions: 


fi= tanh(fI;) (1) 


of the vector-by-matrix product components: 
10 
i= 50 Wi; (2) 
j=l 


Here V; with j = 1,...,9 are the input signals, Vio is a constant bias, f 
is a parameter controlling the function’s nonlinearity, and Wj are 
adjustable (trainable) synaptic weights. Such a network is sufficient 
for performing, for example, the classification of 3 X 3-pixel black- 
and-white images into three classes, with nine network inputs 
(Vj,...,V9) corresponding to the pixel values. We tested the network 
on a set of N = 30 patterns, including three stylized letters (‘z’, ‘V 
and ‘n’) and three sets of nine noisy versions of each letter, formed 
by flipping one of the pixels of the original image (see Fig. 2c). 
Because of the very limited size of the set, it was used for both training 
and testing. 

Physically, each input signal was represented by a voltage V; equal to 
either +0.1 V or —0.1 V, corresponding, respectively, to the black or 
white pixel, while the bias input Vj) was equal to —0.1 V. Such coding 
makes the benchmark input set balanced, in particular ensuring that 
the sum of all input signals across all patterns of a particular class 
is close to zero, which speeds up the convergence process”. To 
sustain this balance at the network’s output as well, each synapse 
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Figure 2 | Pattern classification experiment (top-level description). a, Input 
image. b, The single-layer perceptron for classification of 3 X 3 binary images. 
c, The used input pattern set. d, The flow chart of one epoch of the used in situ 
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training algorithm. In d, the grey-shaded boxes show the steps implemented 
inside the crossbar, while those with solid black borders denote the only steps 
required to perform the classification operation. 
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Figure 3 | Pattern classification experiment (physical-level description). 

a, An implementation of a single-layer perceptron using a 10 X 6 fragment of 
the memristive crossbar. b, An example of the classification operation for a 
specific input pattern (stylized letter ‘z’), with the crossbar input signals equal to 
+Vaz or — Vp, depending on the pixel colour. (The read and write biases were 
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Figure 4 | Pattern classification experiment: results. a, Convergence of 
network outputs, during the training process, to the perfect value (zero), for six 
training runs from different initial states. b, The evolution of output signals, 
averaged over all patterns of a specific class. The inset in a shows the 
distribution of weights W in the initial state and immediately after epoch 21, 
when perfect classification is achieved for the first time for this particular run. 
The classification was considered successful when the output signal f; 


was implemented with two memristors, so that the total number of 
memristors in the crossbar was 30 X 2 = 60. Using external electronics 
to enforce the virtual ground conditions on each column line, and to 
subtract currents flowing in the adjacent columns to form a differential 
output signal J;, we ensured that Ohm’s law applied to each column 
of the crossbar gave a result identical to equation (2), with differen- 
tial weights: 


Wy=G} —Gj (3) 


where GF is the effective conductance of each memristor, namely the 


I/V ratio at voltage 0.1 V. For our devices, these effective conductances 
were in the range 10-100 LMS, so that currents J; were of the order of a 
few microamperes. Activation functions—see equation (1)—were 
also implemented, using external electronics, with the slope f = 2 X 
10° A? chosen according to the recommendation in ref. 28, con- 
firmed by our own computer simulations (Supplementary Fig. 10). 
The network was trained in situ, that is, without using its external 
computer model, using the Manhattan update rule”, which is essen- 
tially a coarse-grain, batch-mode variation of the usual delta rule of 
supervised training’. At each iteration (‘epoch’) of this procedure, 
sketched in Fig. 2d, patterns from the training set were applied, 
one by one, to the network’s input, and its outputs f,(n), where n is 
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always Vg = 0.1 V and Vw = £13 V, respectively.) c, An example of the 
weight adjustment in a specific (first positive) column, for a specific error 
matrix. At the step shown, only the synapses whose weights should be increased 
(marked by ‘+’ in the table on the left) are adjusted, that is, the memristor 
conductances Gi’, Gia’; Gis'> Gig* and Gis” are being increased. 
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corresponding to the correct class of the applied pattern was larger than all 
other outputs. Such perfect classification was achieved, on average, after 23 
epochs, with the standard deviation of ten epochs. The training illustrated by 
b was continued even after the perfect classification had been achieved on 
epoch 21, to verify that the difference between the output signals continued to 
increase (unlike the ‘perceptron rule’ training used in ref. 18). 


the pattern number, were used to calculate the delta-rule weight 
increments: 


with 


(n= [fm —fer)] F 


I=1(n) (4) 


Here f2(n) is the target value of the ith output for the nth input 
pattern. (In our system these values were chosen to be +0.85 for the 
output corresponding to the correct pattern class, and —0.85 for the 
output corresponding to the wrong class.) Once all N patterns of the 
training set had been applied, and all A,() calculated, the synaptic 
weights were modified using the following Manhattan update rule: 


N 
AW, =n sgn) | Aj(n) (5) 


n=1 


where 77 is a constant that scales the training rate. (The only difference 
between the Manhattan update rule from the batch-mode delta rule is 
the binary quantization, expressed in equation (5) by the ‘sgn’ func- 
tion, which simplifies the hardware implementation of the delta rule. 
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Physically, in our system the weights were modified in parallel for 
each column of the crossbar (corresponding to a certain value of index 
iin the above formulas), using two sequential voltage pulses. Namely, 
first a ‘set’ pulse with amplitude Vw" = 1.3 V was applied to increase 
the conductances of the synapses whose AG values, calculated from 
equation (5), were positive; then a ‘reset’ pulse Vw = —1.3 V was 
applied to the remaining synapses of that column (see Fig. 3c). This 
fixed-amplitude pulse procedure followed the Manhattan update rule 
only approximately, because the actual training rate AG depends on 
the initial conductance G of the memristor (see Fig. lc and 
Supplementary Fig. 6). (For G = 20 pS, AG was close to +60 LS for 
the set pulse and —5 uS for the reset pulse, while for G = 65 pS, the 
changes were close, respectively, to +24 US and —55 u1S.) Owing to the 
specific (though quite representative’) switching dynamics of our 
devices, the best classification performance was achieved when the 
memristors had been initialized somewhere in the middle of their 
conductance range, around 35 uS (Supplementary Fig. 7b). At such 
initialization, the perfect classification was reached, on average, after 
23 training epochs (see Fig. 4). 

In summary, here we have experimentally demonstrated an artificial 
neural network using memristors integrated into a dense, transistor- 
free crossbar circuit. This crossbar performed, on the physical (Ohm’s 
law) level, the analogue vector-by-matrix multiplication of equations 
(2) and (3), which is by far the most computationally intensive part of 
the operation of any neuromorphic network used repeatedly in the 
same environment. The other operations, described by equations (1), 
(4) and (5), were performed by external electronics, but they are much 
less critical for network performance, and in future, larger CrossNets 
may be (at least partly) assisted by CMOS subsystems. This is an 
important step towards the effective analogue-hardware implementa- 
tion of much more complex neuromorphic networks, from multilayer- 
perceptron classifiers with deep learning” to elaborate CrossNet-based 
cognitive systems. Recent experiments” with similar but smaller (dis- 
crete) devices imply that such circuits may be scaled down to devices of 
30 nm across or less, that is, to networks with a density of approxi- 
mately 10'° synapses per square centimetre in each crossbar layer. 
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Internal gravity waves, the subsurface analogue of the familiar 
surface gravity waves that break on beaches, are ubiquitous in 
the ocean. Because of their strong vertical and horizontal currents, 
and the turbulent mixing caused by their breaking, they affect a 
panoply of ocean processes, such as the supply of nutrients for 
photosynthesis’, sediment and pollutant transport” and acoustic 
transmission’; they also pose hazards for man-made structures in 
the ocean*. Generated primarily by the wind and the tides, internal 
waves can travel thousands of kilometres from their sources before 
breaking*, making it challenging to observe them and to include 
them in numerical climate models, which are sensitive to their 
effects*’. For over a decade, studies*""’ have targeted the South 
China Sea, where the oceans’ most powerful known internal waves 
are generated in the Luzon Strait and steepen dramatically as they 
propagate west. Confusion has persisted regarding their mech- 
anism of generation, variability and energy budget, however, 
owing to the lack of in situ data from the Luzon Strait, where 
extreme flow conditions make measurements difficult. Here we 
use new observations and numerical models to (1) show that the 
waves begin as sinusoidal disturbances rather than arising from 
sharp hydraulic phenomena, (2) reveal the existence of >200- 
metre-high breaking internal waves in the region of generation 
that give rise to turbulence levels >10,000 times that in the open 
ocean, (3) determine that the Kuroshio western boundary current 
noticeably refracts the internal wave field emanating from the 
Luzon Strait, and (4) demonstrate a factor-of-two agreement 
between modelled and observed energy fluxes, which allows us to 
produce an observationally supported energy budget of the region. 
Together, these findings give a cradle-to-grave picture of internal 
waves on a basin scale, which will support further improvements of 
their representation in numerical climate predictions. 

Internal gravity waves are propagating disturbances of the ocean’s 
density stratification. Their physics resembles that of surface gravity 
waves but with buoyancy rather than gravity providing their restoring 
force—making them much larger (amplitudes of tens to hundreds of 
metres instead of less than ten metres) and slower (hours instead of 
seconds). Generated primarily by tidal flow past seafloor topography 
and winds blowing on the sea surface, and typically having multi- 


kilometre-scale horizontal wavelengths, their estimated terawatt of 
deep-sea dissipation is understood to play a crucial role in the ocean’s 
global redistribution of heat and momentum”. A major challenge is to 
improve our understanding of internal wave generation, propagation, 
steepening and dissipation, so that the role of internal waves can be 
more accurately incorporated into climate models. 

The internal waves that originate from the Luzon Strait on the 
eastern margin of the South China Sea are the largest waves documen- 
ted in the global oceans (Fig. 1). As the waves propagate west from the 
Luzon Strait they steepen dramatically (Fig. 1a), producing distinctive 
solitary wave fronts evident in sun glint and synthetic aperture radar 
images from satellites (Fig. 1b). When they shoal onto the continental 
slope to the west, the downward displacement of the ocean’s layers 
associated with these solitary waves can exceed 150 m in 5 min (ref. 8). 
The scale of these waves means that they pose hazards for underwater 
navigation and offshore drilling’, and supply nutrients from the deep 
ocean that nourish coral reefs' and pilot whale populations that forage 
in their wakes”. 

Over the past decade a number of field studies have been conducted 
in the region; this work has been comprehensively reviewed'®”’. All of 
these studies, however, focused on the propagation of the internal 
waves across the South China Sea and their interactions with the 
continental shelf of China. Until the present study there have been 
no substantial in situ data gathered at the generation site of the Luzon 
Strait, in large part because of the extremely challenging operating 
conditions. A consequence has been persistent confusion regarding 
the nature of the mechanism of wave generation’ with an underlying 
cause being the sensitivity of the models employed to the system 
parameters, such as the chosen transect for a two-dimensional model, 
the linear internal wave speed or the assumed location of the waves’ 
origin within the Luzon Strait. Furthermore, the lack of in situ data 
from the Luzon Strait has meant an inability to test numerical predic- 
tions of energy budgets’ and no knowledge of the impact of the 
Kuroshio Current on the emergence of internal solitary waves". 

The goal of the Internal Waves in Straits Experiment (IWISE) is to 
obtain the first comprehensive in situ data set from the Luzon Strait, 
which in combination with high-resolution three-dimensional numer- 
ical modelling supports a cradle-to-grave picture of the life cycle of the 
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world’s largest known oceanic internal waves. IWISE is the most sub- 
stantial internal wave field programme since the Hawaiian Ocean 
Mixing Experiment (HOME)"* revealed how most of the tidal losses 
from a complex ocean ridge system can be radiated away as nearly 
sinusoidal waves, with a small fraction dissipated locally. IWISE is 
broader in scope, however, because the radiated internal tide is non- 
linear and therefore subject to additional steepening and wave-break- 
ing processes. A pilot study was performed in the summer of 2010 to 
determine the feasibility of operating at desired locations”, and the full 
field programme was executed throughout the summer of 2011. This 
Letter is a synthesis of the results from both programmes. 

The internal tides derive their energy from the barotropic (surface) 
tide: the flow arising from astronomical forcing of the oceans by the 
Sun and the Moon. The barotropic tides in this region are a combina- 
tion of twice-daily (semidiurnal, D2) and once-daily (diurnal, D1) 
motions, giving rise to a strong diurnal inequality and a fortnightly 
amplitude envelope caused by the interaction of lunar and solar tides 
(Fig. 1d). Our moorings, shipboard stations, and autonomous gliders 
within and to the west of the Luzon Strait reveal time-averaged west- 
ward energy fluxes of 40 + 8 kW m ! (Fig. 2a; see Methods). Such 
values are about 100 times typical open-ocean values’ and exceed any 
other known generation site around the world'*. The measured fluxes 
agree to within a factor of two in magnitude and +30° in direction with 
the predictions of our far-field numerical model (see Methods), as well 
as previous numerical predictions’. 
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Figure 1 | Overview of internal waves in the 
South China Sea. a, Vertical displacement of 
ocean layers near 200 m depth from the far-field 
numerical simulation. Orange and blue indicate 
upward and downward displacements, 
respectively. b, A collage of synthetic aperture radar 
images taken on 12 August 2009, 04 August 2009 
and 25 April 2005. Wave fronts are visible because 
they focus surface waves, increasing sea surface 
roughness. Red and blue correspond to greater and 
less surface roughness, respectively. 

c, Instrumentation deployed during IWISE. The 
Kuroshio Current is sketched schematically. PIES, 
pressure inverted echo sounder. d, Time series 
(5-20 August 2011) of depth-averaged tidal 
current in Luzon Strait over a spring/neap cycle, 
showing the presence of once-daily (diurnal, D1) 
and twice-daily (semidiurnal, D2) frequencies. 
Inset, globe showing location of Luzon Strait and 
South China Sea. 
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The total energy flux in the northern section of the Luzon Strait shows 
a clockwise pattern between the tall east and west ridges (Fig. 2a). This 
feature exists because the double-ridge structure creates a 100-km-scale 
resonant cavity for the ~100-km wavelength semi-diurnal internal 
tide’’; the existence of this resonant cavity is confirmed by observations 
of very high energy density but little energy flux between the two ridges, 
characteristic of a standing-wave pattern’’. Tidal flow transiting the 
ridge in the vicinity of the Batanes Islands (Fig. 2f) gives rise to dramatic 
localized lee wave phenomena”. Shipboard time series of velocity and 
density, obtained at the location of the dashed lines in Fig. 2b and c, are 
presented in Fig. 2d. Observed vertical displacements of the ocean layers 
reach up to 500 m, with static instabilities >200 m high. The associated 
depth-integrated turbulent dissipation levels (Fig. 2e; see Methods) 
approach 20 W m’, exceeding open-ocean values by a factor of 
1,000-10,000 (refs 12 and 15). The associated vertical mixing of quantities 
such as temperature and salinity is sufficiently high (>107' m*s~')—also 
orders of magnitude greater than open-ocean values—that it probably 
plays a key part in setting large-scale circulation patterns throughout the 
South China Sea’’. 

There was no evidence of coherent nonlinear internal waves east of 
120.5° E, disproving previous conjectures that the large-amplitude 
internal solitary waves arise in the immediate vicinity of the Luzon 
Strait via a hydraulic lee wave mechanism". Rather, what emerges 
immediately to the west of the Luzon Strait is a broad, energetic, spatially 
coherent, nearly sinusoidal internal tide (Figs 1a, 2a and 3a, b) at a 
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combination of semidiurnal and diurnal frequencies (Fig. 1d). The struc- 
ture of the wave field is dominated by ‘mode 1’ behaviour", this being the 
fundamental vertical mode of oscillation in which velocity in the upper 
few hundred metres of the ocean is in the opposite direction to, and 
oscillates out of phase with, the velocity in the deeper ocean (Fig. 3d). The 
three-dimensional structure of the ridge system within the Luzon Strait 
shapes the radiated semidiurnal and diurnal internal tides differently 
because the horizontal wavelength of the former is half that of the latter. 
The semi-diurnal internal tide energy flux is strongest within a beam that 
emanates from the central section of the Luzon Strait, between 20° N and 
21° N (Fig. 3a), while a broader beam of diurnal energy flux emanates 
from across the central and southern sections of the Luzon Strait 
(Fig. 3b). These beams are also evident in the field data: characteristic 
diurnal energy fluxes of ~5 kW m7! and ~20 kW m' were detected in 
the central and southern sections of the Luzon Strait, respectively, and 
the corresponding values for the semi-diurnal energy fluxes were 
~20kW m ‘and ~10kW m | (ref. 15). These results and interpreta- 
tions were furthermore supported by a rotating, large-scale laboratory 
experiment, using an accurate scale model of the Luzon Strait”®. 

The combination of mode-1-dominated semidiurnal and diurnal 
internal tides sets the initial condition for the evolution of large-ampli- 
tude solitary waves”’. Whether or not steepening occurs depends cru- 
cially on the balance between nonlinearity and rotational dispersion, 
which serve to enhance and reduce the steepness of the wavefronts, 
respectively*’”. When the semidiurnal internal tide dominates, nonli- 
nearity can overcome rotational dispersion, leading to the formation of 
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Figure 2 | Near-field processes in 
the Luzon Strait. a, Time-mean total 
energy flux from the near-field 
numerical model (white arrows) and 
field measurements (coloured 
arrows). LADCP, lowered acoustic 
Doppler current profiler 
measurements (see Methods). 

b, c, Snapshots from the two- 
dimensional model showing internal 
wave breaking at the location 
indicated in a by the green star, 
corresponding to times Ty and T; 
indicated in f. Colours and lines in 
b and c indicate east-west velocity 
and density contours, respectively. 
d, Corresponding field 
measurements at the location of the 
vertical dashed line in b and 

c. e, Depth-averaged dissipation rate 
computed from Thorpe scales. 

f, Depth-integrated eastward tidal 
transport, showing the times To, T; of 
the frames in b and c. 
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internal solitary waves around 120° E as seen in Fig. 1a, b; if diurnal 
forcing dominates, however, rotational dispersion tends to suppress 
wave formation. Other factors are the fortnightly spring/neap cycle 
(Fig. 1d), which sets the overall amplitude of barotropic tidal forcing 
in the Luzon Strait, and the interaction between the semidiurnal and 
diurnal tides, which can cause alternating strong and weak wave fronts”. 

Another key influence on the radiated waves is the Kuroshio 
Current, one of the oceans’ foremost (in volume, extent and velocity) 
boundary currents, whose modelled lateral structure and velocities of 
over 0.5ms | were confirmed by our measurements (Fig. 4a). When it 
intrudes into the Luzon Strait, the Kuroshio Current refracts the 
internal tide, profoundly affecting the subsequent propagation path 
of the internal waves. For example, we determined that the orientation 
of the semidiurnal beam was notably different during our winter 2010- 
2011 field study** compared to a previous field study in winter 2006, 
for which moorings were deployed near Dongsha Island”. No solitary 
waves were detected in winter by the latter study (Fig. 4b), so solitary 
waves were thought not to be present in the South China Sea during the 
wintertime. This is not the case, however, as solitary waves persisted 
throughout the winter of our 2010-2011 field study (Fig. 4c)**, Instead, 
we determined that during winter 2006 the Kuroshio Current was 
responsible for directing the semidiurnal internal tide beam, and thus 
the solitary waves, south of the mooring location™*. 

Proceeding into the shallower waters of the Dongsha Plateau at 
speeds of roughly 3 m s_', the internal solitary waves begin to slow 
down”. Eventually, the wave-induced fluid velocities can exceed the 
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Figure 3 | Internal wave energy fluxes in the 
South China Sea. a, b, Semidiurnal (a) and diurnal 
(b) energy flux from the far-field model. c, Energy 
flux along 21° N. Arrows in c indicate integrated 
energy fluxes (numbers on arrows are fluxes in 
kilowatts per metre) at 21° N in the semidiurnal 
and diurnal internal tides and in the solitary or 
nonlinear internal waves (NLIW). Flux values at 
120° E are from the near-field model; flux and 
dissipation values at 115.19° E, 117.25° E and 
117.895° E are from observations (see Methods). 
d, Bathymetry along 21° N. The processes of 
generation, breaking, propagation, steepening and 


5A dissipation are shown schematically. 
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wave speed, leading to the formation of trapped cores”, wherein fluid 
is carried along with the wave. At this stage, the waves vertically dis- 
place water up to 170 m, nearly 40% of the local ocean depth, in only a 
few minutes and have wavelengths of only a few hundred metres. The 
solitary waves become highly turbulent, leading to strong vertical mix- 
ing that has been postulated to contribute to the high biological pro- 
ductivity in the vicinity of the nearby Dongsha coral atoll’. We 
observed the waves to become convectively and shear unstable, pro- 
ducing vertical overturns of up to one hundred metres within the core. 
Measured turbulent dissipation rates are extreme, exceeding 1.5 X 
10 *W kg’ ', contributing to a cross-shelf integrated dissipation level 
of 10 kW per metre of coastline (see Methods). 

The combination of IWISE numerical models and field measure- 
ments enables us to estimate an energy budget for internal waves 
throughout the entire South China Sea, as summarized in Fig. 3c. Our 
far-field model (see Methods) predicts that, out of a total conversion of 
24 GW from the barotropic tide at the Luzon Strait, 15 GW are radiated 
to the east and west, agreeing to within a factor of two with observations; 
this suggests that around 40% of the energy is dissipated locally at the 
Luzon Strait. For comparison, our higher-resolution near-field model, 
which captured the generation region only, obtains a similarly high 
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fraction of locally dissipated energy, but conversion and fluxes are about 
1.8 times higher. These results suggest that the Luzon Strait is a more 
highly dissipative system than the Hawaiian Ridge, where only about 
15% of the energy is dissipated locally**. Differences between the two 
models, however, which we attribute to model resolution and their 
inability to directly resolve turbulence, emphasize that further computa- 
tional advances are still required for quantitative certainty. 

The observed westward energy flux at 120° E is approximately equi- 
partitioned between semidiurnal and diurnal motions, with no energy yet 
in the sharp wavefronts. Closer to the continental slope at 117.25° E, 
however, the nonlinear internal waves (NLIW) gain energy at the expense 
of the internal tide, and energy is more equally distributed among all three 
components (D1, D2 and NLIW). At the continental slope, the diurnal 
motions are partially reflected (eastward arrows at 117.895° E), while the 
steeper semidiurnal motions and nonlinear waves are transmitted, with 
active turbulence dissipating about 3 kW m_‘ (ref. 29). By 115.19° E, 
strong dissipation due to wave breaking over the continental shelf has 
removed nearly all energy in the nonlinear waves and internal tides. 

The comprehensive modelling and measurements of the WISE pro- 
gramme have resulted in a first-order energy budget and an understand- 
ing of the key physical processes occurring in the Luzon Strait that lead to 
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Figure 4 | The Kuroshio and its 
impact on wave propagation. 

a, Observed (green) and modelled 
(grey) Kuroshio flow during June- 
August 2011 in the Luzon Strait 
region. The meshes are modelled 
phase lines of internal waves during 


February 2006 (red) and February 
2011 (blue). KS model indicates the 
NRL Kuroshio model (see Methods). 
b, c, Measured wave displacement at 
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the locations shown in a. Waves were 
observed year-round at the southern 
station in 2011 (c), but not at the 
northern station in 2006 (b), when 
the Kuroshio Current deflected the 
internal wave paths southward 
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the formation of the world’s strongest known internal waves. Conversion 
from the barotropic tide leads to strong internal tides accompanied by 
vigorous near-field turbulence. These internal tides radiate westward 
through the strong Kuroshio Current, steepen into sharp wavefronts, 
and eventually break on the continental shelf. In spite of this clear phys- 
ical picture, some questions remain, including the precise fraction of 
energy dissipated locally and the role of the Kuroshio Current in modu- 
lating the wave amplitudes. To improve the representation of internal 
waves and their associated vertical mixing in numerical climate predic- 
tions will require more sophisticated and high-resolution numerical 
models that can successfully capture the weakly nonlinear, non-hydro- 
static, and strongly nonlinear mechanisms”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Numerical models. Several numerical models were used in this paper, represent- 
ing varying degrees of resolution, domain size and resolved processes. Four three- 
dimensional models with realistic bathymetry and stratification were employed, 
namely a ‘far-field’ model, a ‘near-field’ model, and two ‘Kuroshio’ models. A two- 
dimensional (horizontal—vertical) model was also used. 

The three-dimensional models were used to simulate basin-scale waves, near- 
field physical processes, and the role of the Kuroshio Current, respectively. All 
were forced with predictions using eight tidal constituents (K1, O1, P1, Q1, K2, 
M2, N2 and S2) from a barotropic tidal model, TPXO7.2 (ref. 31), which was 
validated against measured currents in the region in this and a previous experi- 
ment'***. Bathymetry was from multibeam depth soundings where available, and 
from the 30-arcsecond database* elsewhere. Stratification was horizontally uni- 
form in the far-field and near-field models, obtained from the generalized digital 
environmental model database (GDEM) climatology for the month of August** 
for the far-field model and from August 2010 field data for the near-field model. 
Stratification in the Kuroshio models was from larger-scale data-assimilating 
regional simulations. 

The far-field model (Figs 1a and 3a, b) is based on the Hallberg isopycnal 
model and encompasses the entire South China Sea with a spatial resolution 
of 2 km and 40 layers in the vertical’*. The near-field model data (Fig. 2a), with a 
horizontal resolution of 250 m, were generated using a 154-layer implementation 
of the MIT global climate model (MIT gem; ref. 36), based on the configuration in 
ref. 10. The MITgcm was also used to generate the two-dimensional numerical 
results presented in Fig. 2b and c. For this, the bathymetry was a transect taken 
from the near-field model and calculations were performed using a telescoped grid 
with horizontal and vertical resolutions of 7.8 m and 6.1 m, respectively. None of 
the models explicitly resolve the three-dimensional turbulence that is directly 
responsible for model energy dissipation. 

The two Kuroshio models used are different implementations, but similar in 
their resolution, intent and skill. One (Fig. 4a, vectors) is an application of the NRL 
Ocean Nowcast/Forecast*”™, an integration of the Navy Coastal Ocean Model 
(NCOM) and a statistical data-analysis model. In addition to the tidal forcing, the 
model’s open boundary conditions are from a larger-scale model for the entire east 
Asian seas*°’. The second model, used to generate the ray paths (Fig. 4c, red/blue 
meshes), is a data-assimilating version of the Hybrid Coordinate Ocean Model 
(HYCOM)". 

Shipboard measurements. The basic physical quantities required to characterize 
physical ocean flows are the potential density (density with the compression effects 
of hydrostatic pressure removed) and velocity. The former is measured with 
conductivity, temperature and depth instruments (CTDs) that are repeatedly 
lowered and raised from a shipboard winch. Salinity S is computed from temper- 
ature T, conductivity and pressure P measurements, and density is then a function 
of salinity, temperature and pressure. Velocity is measured with acoustic Doppler 
current profilers (ADCPs). These are affixed to the hull of the ship or lowered with 
the CTD instrument, and velocity is measured as a function of depth beneath the 
ship by the phase shift of 75-kHz, 150-kHz or 300-kHz acoustic pulses backscat- 
tered from the water column. The precision of CTD and ADCP measurements 
depends on set-up and several other factors. Typical uncertainties for density and 
velocity are 2 X 10° *kg m~* and 0.01 ms-_', respectively, which are one to two 
orders of magnitude less than the signals described here. The energy flux measure- 
ments presented in Fig. 2a (flux calculation described next) are computed from 36- 
h time series stations wherein velocity and pressure are repeatedly measured by 
cycling the CTD up and down approximately once each hour. The measurements 
in Fig. 2e are from a specialized ‘fast CTD’ system“* designed to sample much faster 
than a conventional CTD (a profile approximately every ten minutes as opposed to 
about every hour, depending on water depth). 

Glider measurements. Measurements of energy flux were made from autonom- 
ous gliders” as part of IWISE (Fig. 1c). Gliders are autonomous underwater 
vehicles that move up and down through the water by adjusting their buoyancy, 
which is accomplished by filling and draining oil from a bladder. Wings allow the 
glider to ‘fly’ through the water at about 0.25 ms‘. During IWISE, two gliders 
were deployed for about two months each, and sampled density and velocity in the 
upper 500 m every 3 h. 

Moored measurements. Three types of moored measurements were made during 
IWISE: (1) Profiling moorings featuring a McLane Moored Profiler crawling up 
and down a vertical moored wire approximately every 1.5 h between 300 m and 
400 m below the surface and about 10 m above the bottom. Above, a series of 
approximately 30 densely spaced temperature loggers and an ADCP gave tem- 
perature (from which density was computed) and velocity in the upper ocean. The 
moored profilers carried current meters and CTDs, giving continuous, full-water- 
column measurements of density and velocity from a mooring, a challenging task. 


(2) ADCP/T-chain moorings held ADCPs and made temperature and/or salin- 
ity measurements, giving faster sampling at the cost of vertically continuous data. 
Both of these types of moorings were prone to substantial knockdowns by the 
extreme currents in Luzon Strait. Knockdowns were minimized by highly taut 
designs and (for one mooring) a low-drag cylindrical float. When they occurred 
(up to 100 m in the worst case), they were corrected for by means of pressure 
measurements on the top subsurface floats. 

(3) A final moored measurement used bottom-mounted pressure inverted echo 
sounders (PIES), which measure bottom pressure and the round-trip bottom- 
top-bottom travel time of an acoustic pulse transmitted upward every few sec- 
onds. Since sound speed depends on temperature, these signals are proportional to 
the mode-1 displacement of the thermocline“. True mode-1 displacements in 
Fig. 4a and b were computed from travel time using nearby moored in situ 
temperature measurements, and have an overall uncertainty of 4 m (ref. 44). 
Turbulence measurements. The turbulence measurements described on the con- 
tinental shelf were obtained directly using a Rockland Vertical Microstructure 
Profiler. The turbulent dissipation rate ¢ was estimated by fitting small-scale 
velocity shear observations to a theoretical spectrum and integrating by following 
the method of ref. 45. The turbulent dissipation results in Fig. 2d were obtained 
indirectly using the method of overturns or “Thorpe sorting”**, which computes 
the outer scale of the turbulence via the vertical distance parcels of water have 
moved from a stably stratified profile. Numerous previous studies**** have 
demonstrated good agreement with direct measurements; although recent mod- 
elling work” has given rise to the possibility that Thorpe-based estimates could be 
biased high by a factor as great as 2 to 6. 

Synthetic aperture radar imagery. Although nonlinear internal waves propagate 
in the interior of the ocean, their currents produce convergent (rough) and diver- 
gent (smooth) zones on the ocean surface that move in phase with their subsurface 
crests and troughs. These variations in surface roughness create the distinctive 
light/dark pattern of the internal wave packets found in synthetic aperture radar 
images of the ocean (Fig. 1b), which have a typical horizontal resolution of tens of 
metres. The technique is sensitive to the background roughness of the sea surface, 
which depends on wind speed and other factors. Therefore, imagery is an excellent 
indicator of the location and morphology of the wave fronts, but generally cannot 
give wave amplitude. 

Energy flux. The internal tide energy flux F (presented in Figs 2a and 3a, b) is 
computed from both model and observed data as the depth integral of <u’p'>, 
where u’ is the measured velocity fluctuation with the barotropic tide removed and 
p’ is the baroclinic pressure, computed from the density assuming hydrostatic 
balance. 

Energy flux is computed separately for the semidiurnal and diurnal motions by 
use of harmonic analysis. Shipboard time-series stations are always conducted for 
= 36 h, allowing separation of these motions from each other and from the inertial 
frequency, which is 33.4 h at this latitude. For each 36-h station or 3-day time 
period in the case of moored and glider data, least-square fits are done at each 
depth to semidiurnal, diurnal and inertial motions. Much longer time series (~14 
days) would be required to separate the different tidal constituents within each 
band (for example, M2/S2 and K1/O1), so the diurnal and semidiurnal motions 
are referred to as D1 and D2, respectively. 

Nonlinear coupling between different frequencies would complicate our 
method of separation into bands. To ensure that nonlinear terms are not import- 
ant in this context, the sum of the separated fluxes is compared to the total flux 
before separation. The two agree to within 10%'*!7*". 

Because it requires a vertical integral, the calculation of p’ relies on full-water- 
column data. Therefore, gaps in the water column measurements give rise to 
errors, particularly when they are near the surface where the flux is the greatest. 
For the model data and CTD/ADCP station measurements (Fig. 2a) gaps and 
associated errors are negligible. For the moored measurements where gaps are only 
tens of metres out of thousands, the associated errors are about 20%°*, which are 
determined by sampling the full-water-column model output with the coverage of 
the moorings. For the glider measurements, which sample only the upper 500 m, 
calculation of energy flux relies on fitting the data to the first baroclinic mode. 
Although the moored data verify that the bulk of the energy is in this mode, glider 
uncertainty in total flux is about 30%”. 

To account for modulation by the spring/neap cycle, flux values from 36-h 
stations are converted to time-mean values by scaling them with the square of 
the barotropic velocity during the time of observations’*. Values from moorings, 
which directly measure this modulation, are simply the time-mean for each con- 
stituent. 

The energy flux of nonlinear waves (presented in Fig. 3c) has two extra terms in 
addition to the linear term computed above for the internal tides. An accurate 
expression for the energy flux of nonlinear waves is cE, where c is the wave speed 
and E is the sum of their kinetic and available potential energy”. The 
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nonlinear waves’ short timescales (minutes as opposed to hours for the internal 
tides) enables them to be easily isolated from the internal tides by means of 
bandpass filtering. 
Energy budget calculation. Although in general internal wave energy arises from 
a combination of wind and tidal forcing, in Luzon Strait the dominant energy 
source is the barotropic tide. Conversion from barotropic tidal motions to bar- 
oclinic energy can be quantified as C= UgrVHppot, where Upr is the barotropic 
tidal velocity, H is the seafloor depth and p,; is the pressure at the seafloor. 
Though observations can be used to compute conversion”, they are far too sparse 
to estimate the integral over the near-field region, so we use the model (uncer- 
tainties presented below). 

In steady state, an energy budget for the internal tides (which here we consider 
to be comprised of only the nearly sinusoidal waves) can be written as 


C—VF=D (1) 


where D represents all processes removing energy from the internal tide, including 
dissipation and transfer of energy to the sharp nonlinear waves seen to the west. In 
the near-field region where no nonlinear waves are yet present, D represents 
primarily turbulent processes. In the west, reductions in internal tide flux also 
arise as nonlinear wave fluxes increase (Fig. 3c). 

None of the models resolve turbulence. However, the two- and three-dimen- 
sional near-field models resolve the processes that lead to it, primarily breaking 
internal waves, as shown in Fig. 2. Dissipation in these models is computed from a 
closure scheme” similar to the above Thorpe sorting algorithm used in the mea- 
surements. Observed and modelled dissipation have been demonstrated to be 
within a factor of two of one another in breaking lee waves observed at several 
sites in the Luzon Strait'’ and one at the Hawaiian Ridge’’, but a detailed com- 
parison at Luzon Strait is not yet complete. 

The integrated conversion and fluxes discussed in the main text are computed 
by integrating the far-field model (Fig. 3a, b) over the domain (19°-21.5° N, 120°- 
122.5° E). Flux and dissipation values plotted in Fig. 3c are all from measurements 
near 21° N***°. Measurements along a line rather than integrated values are 
subject to reduction by cylindrical spreading. As is evident in Fig. 3a and b, the 
model shows a factor-of-two reduction in fluxes along 21° N for D1 and a <20% 
reduction for D2, owing to differences in their generation geometry. 

Measurements at 120° E are from gliders, which agree well with the far-field 
model”. Internal tide and nonlinear internal wave flux estimates further to the 
west are from moorings. Turbulence estimates on the continental slope and upper 
shelf are from Thorpe scales, while those west of 117.25° E are directly measured 
with the Vertical Microstructure Profiler. The separate eastward and westward 
fluxes from the mooring at 117.895° E are estimated by assuming the continental 
slope is a vertical wall (a good assumption for the diurnal motions); the separate 
fluxes are then estimated from the ratio of the total energy to the flux with 
knowledge of the distance between the mooring and the wall. 

Integrated dissipation on the slope and shelf is computed from the available 

observations by simply integrating with respect to depth and multiplying by the 
cross-slope distances indicated in Fig. 3c, which assumes that the observed loca- 
tions are representative. Because the validity of this assumption is not known, the 
estimates are uncertain by at least a factor of two. 
Model uncertainty. The skill of the models is evaluated by first comparing energy 
flux estimates point-by-point with observations, and then comparing integrated 
conversion and energy flux between the different models. Far-field model and 
observed flux magnitude are generally within a factor of two of each other, with 
flux direction agreeing to about 30° (Extended Data Fig. 1). Modelled D2 fluxes are 
about twice the observed values at the northern stations (N1, N2, LS02; see ref. 15 
for station locations), suggesting the possibility of a high bias in the model in that 
region. However, glider and far-field model values just west of these stations are in 
agreement, suggesting instead that the poorer agreement at these stations may be 
due to the extremely steep slopes at those locations. 

Additional uncertainty arises from the energy difference between the models, 
which is also approximately a factor of two (Extended Data Table 1). Specifically, 
near-field conversion and radiated flux in the near-field model exceed those in the 
far field and Kuroshio models by nearly a factor of two, probably because of the 
near-field model’s higher resolution. All three models have very similar spatial 


LETTER 


patterns, consistent with those inferred from observations (Fig. 2a; refs 15, 17). 
Taking into account the factor-of-two scatter between the observations and the 
far-field model, and this factor-of-two energy difference between the different 
models, we choose to use the far-field model in the energy budget calculation, 
and estimate an overall factor-of-two uncertainty in all model flux and conversion 
estimates. 

Data. Source data for each figure are provided as spreadsheets online. 
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Extended Data Figure 1 | Comparison of observed and model energy flux. observed values and +10% for model values (see Methods). Right, As for the 


Left, Scatter plot of flux magnitude F from observations (x axis) and far-field _ left panel, but for direction 0; error bars are +30°. See source data and ref. 15 for 
(UAF; University of Alaska Fairbanks) model (y axis). Error bars are +20% for station locations. Black, semi-diurnal; red, diurnal. 
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Extended Data Table 1 | Conversion and radiated flux integrated over the region 19° to 21.5° N, 120° to 122.5° E. 
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A bizarre Jurassic maniraptoran theropod with 
preserved evidence of membranous wings 


Xing Xu**, Xiaoting Zheng**, Corwin Sullivan”, Xiaoli Wang’, Lida Xing*, Yan Wang", Xiaomei Zhang”, Jingmai K. O’Connor’, 


Fucheng Zhang” & Yanhong Pan® 


The wings of birds and their closest theropod relatives share a 
uniform fundamental architecture, with pinnate flight feathers 
as the key component’ *. Here we report a new scansoriopterygid 
theropod, Yi qi gen. et sp. nov., based on a new specimen from the 
Middle-Upper Jurassic period Tiaojishan Formation of Hebei 
Province, China‘. Yiis nested phylogenetically among winged ther- 
opods but has large stiff filamentous feathers of an unusual type on 
both the forelimb and hindlimb. However, the filamentous feath- 
ers of Yi resemble pinnate feathers in bearing morphologically 
diverse melanosomes’. Most surprisingly, Yi has a long rod-like 
bone extending from each wrist, and patches of membranous tissue 
preserved between the rod-like bones and the manual digits. 
Analogous features are unknown in any dinosaur but occur in 
various flying and gliding tetrapods*”, suggesting the intriguing 
possibility that Yi had membranous aerodynamic surfaces totally 
different from the archetypal feathered wings of birds and their 
closest relatives. Documentation of the unique forelimbs of Yi 
greatly increases the morphological disparity known to exist 
among dinosaurs, and highlights the extraordinary breadth and 
richness of the evolutionary experimentation that took place close 
to the origin of birds. 


Theropoda Marsh, 1881 
Maniraptora Gauthier, 1986 
Scansoriopterygidae Czerkas et Yuan, 2002 
Yi qi gen. et sp. nov. 


Etymology. The generic and specific names are derived from 
Mandarin Yi (wing) and qi (strange), respectively, referring to the 
bizarre wings of this animal. The intended pronunciation of the name 
is roughly “ee chee”. 

Holotype. STM 31-2 (housed at the Shandong Tianyu Museum of 
Nature), an articulated partial skeleton with associated soft tissue pre- 
served on a slab and counter slab. The specimen was collected by a 
local farmer, but its provenance and authenticity have been confirmed 
by multiple lines of evidence including sedimentology, taphonomy 
and computed tomography (CT) data (Extended Data Figs 1-4; see 
also Supplementary Information). 

Locality and horizon. Mutoudeng, Qinglong County, Hebei Province, 
China. Tiaojishan Formation, Callovian-Oxfordian stage*”’. On the 
basis of the provenance of the specimen, Yi qi is a member of the 
Daohugou (or Yanliao) Biota’. 

Diagnosis. A scansoriopterygid theropod distinguishable from 
other scansoriopterygids in having a low midline crest along the 
nasals, a relatively small and posteriorly located external mandibular 
fenestra, tooth crowns that are symmetrical in lateral view and con- 
siderably wider mesiodistally than their corresponding roots, a 
humerus and ulna that are long relative to the tibiotarsus (length 


ratios are 1.16 and 1.08, respectively, compared to 0.96 and 0.78 in 
Epidendrosaurus and 0.79 and 0.66 in Epidexipteryx), an extremely 
short humeral deltopectoral crest, and a long rod-like bone articu- 
lating with the wrist. 

Key osteological features are as follows. STM 31-2 (Fig. 1) is inferred 
to be an adult on the basis of the closed neurocentral sutures of the 
visible vertebrae, although this is not a universal criterion for maturity 
across archosaurian taxa’. Its body mass is estimated to be approxi- 
mately 380 g, using an empirical equation”. 

The skull and mandible are similar to those of other scansoriopter- 
ygids, and to a lesser degree to those of oviraptorosaurs and some basal 
birds such as Sapeornis and Jeholornis''*"’: the skull is relatively 
robust, the snout is short (approximately 40% of skull length), the large 
premaxilla bears a well-developed subnarial process and extends fur- 
ther ventrally than the small maxilla, the parietal is anteroposteriorly 
long, the antorbital fossa is much smaller than the orbit, the infratem- 
poral fenestra is large, the anterior end of the robust mandible is 
downturned, four slightly procumbent premaxillary teeth are located 
anterior to the external naris, the first premaxillary tooth is larger than 
the more posterior ones, and the anterior dentary teeth are strongly 
procumbent (Fig. la-d). The thoracic appendage resembles that of 
other paravians in that the scapula is proportionally short and the 
humerus is long and robust, and further resembles that of other scan- 
soriopterygids'*° in having a robust scapular blade, a short humeral 
deltopectoral crest, a straight ulna that is only slightly more robust than 
the radius, a metacarpal IV that is longer and more robust than meta- 
carpal III (we identify the three manual digits of Yi and other manir- 
aptorans as II-III-IV, following the position-based numbering used in 
most ornithological literature, although we acknowledge that an ana- 
tomy-based I-II-II] numbering convention is preferred by most dino- 
saur workers), and non-ungual phalanges of digit IV that are all greatly 
elongated (Fig. 1a, b, e). 

The most striking feature of Yi is the presence of an anomalous, 
slightly curved, distally tapered, rod-like structure (see Supplementary 
Information for further description), whose length considerably 
exceeds that of the ulna, associated with each wrist and apparently 
extending from the ulnar side of the carpus (Fig. 1a, b, f and Extended 
Data Fig. 5). These structures resemble the preserved bones on the slab 
in colour, and in displaying a granular texture under magnification. 
Energy dispersive spectrometry (EDS) analysis of the elemental com- 
position of the rod extending from the right wrist confirms that it is a 
bone (or possibly a skeletal element composed of calcified cartilage) 
and not a soft-tissue structure (Extended Data Fig. 6; see also 
Supplementary Information). However, the rod-like bone of the fore- 
limb of Yi is morphologically unlike any normal theropod skeletal 
element. Indeed, no equivalent of the rod-like bone is known in any 
other dinosaur even outside Theropoda, but similar structures are 
present in a diverse array of extant and extinct flying or gliding 
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Figure 1 | Yi qi holotype (STM 31-2). a, b, Photograph (a) and line drawing 
(b) of specimen; c, skull and mandible in lateral view; d, premaxillary tooth in 
lateral view; e, left manus; f, styliform elements (the distally unexposed left 
styliform element articulates with the wrist, and the orientation of the right 
styliform element implies a similar relationship to the carpus even though its 
proximal part is missing). Light and dark grey shading indicates feathers and 
membranous tissues, respectively. an, angular; cv, cervical vertebrae; d, dentary; 
dr, dorsal ribs; emf, external mandibular fenestra; en, external naris; f, frontal; lf 
left femur; lh, left humerus; lmd2-4, left manual digits 2-4; Imt, left metatarsals; 
Ir, left radius; Is, left scapula; Ise, left styliform element; lu, left ulna; mb, 
mandible; mcII-IV, metacarpals II-IV; n, nasal; or, orbit; p, parietal; phII1 to 
IV4, phalanges II-1 to IV-4; pma, premaxilla; rmd2-4, right manual digits 2-4; 
rf, right femur; rfi, right fibula; rh, right humerus; rmc, right metacarpals; rmt, 
right metatarsals; rr, right radius; rse, right styliform element; rt, right 
tibiotarsus; ru, right ulna; sk, skull. Scale bar, 2 cm. 


tetrapod groups*”°. For example, a rod of bone or cartilage is attached 
to the wrists of petauristine squirrels'’; the elbows of anomalurid 
squirrels’, the gliding marsupial Petauroides volans'? and the 
Oligocene epoch eomyid rodent Eomys quercyi; the ankles of many 
bats, including some early ones”’°; and the wrists of all pterosaurs® 
(that is, the pteroid). In particular, the rod-like bone of Yi is strikingly 
similar to the enlarged carpally situated element seen in some petaur- 
istines’, including the Japanese giant flying squirrel Petaurista leuco- 
genys (Extended Data Fig. 7; see also Supplementary Information). For 
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convenience, we adopt ‘styliform element’ as a general term for 
unjointed, rod-like bony or cartilaginous structures that extend from 
distal limb joints in tetrapods. 

Integumentary structures and preserved melanosomes are as fol- 
lows. Two major types of integumentary structure are preserved (see 
Supplementary Information for further description): feathers and 
membranous soft tissue (Figs la, b, 2 and Extended Data Fig. 5). 
Thin stiff filamentous feathers are distributed around the skull (15- 
20mm long; Fig. 2a), and both above and below the neck (about 
30mm long; Extended Data Fig. 5a). The feathers attached to the 
forelimb (35-60 mm long; Fig. 2b and Extended Data Fig. 5b, c), 
and to the hindlimb including the metatarsus (up to 60mm long; 
Fig. 2c and Extended Data Fig. 5e), are much larger than the skull 
and neck feathers. Some of the filamentous feathers, including those 
on the skull and neck, appear to exhibit a simple branching pattern, but 
their dense preservation makes morphological details difficult to 
observe. Nevertheless, some isolated feathers clearly comprise multiple 
radiating filaments (Fig. 2d and Extended Data Fig. 5d), and the major- 
ity of the limb feathers have a unique paintbrush-like morphology 
(Fig. 2e): the proximal three-fourths of each feather is a wide (about 
1.2 mm), undifferentiated shaft-like structure, whereas the distal part 
is composed of numerous near-parallel filaments. 

Several patches of membranous soft tissue are exposed around the 
styliform element and digits in both hands (Figs 1a, b, 2fand Extended 
Data Fig. 5f, g). Additional patches appear to exist, but cannot be easily 
exposed (see Supplementary Information). The tissue has a sheet-like 
appearance, making it clearly distinguishable from the individual fila- 
ments or small groups of filaments that represent feathers (Fig. 2g), 
and in some areas displays prominent ripple-like striations that may 
represent either fibres or closely spaced folds. The membranous tissue 
does not closely resemble any type of integumentary structure prev- 
iously reported in theropods from the Mesozoic era deposits of north- 
eastern China. 

We examined the integumentary structures using scanning electron 
microscopy (Extended Data Fig. 8), revealing what we interpret as 
preserved melanosomes on both feathers and membranous tissue 
patches (although several studies have questioned the identification 
of melanosomes in theropod fossils”*”’). The melanosomes are highly 
variable in size (long axis 300-2,100 nm), shape (aspect ratio of 1.0 to 
3.6) and density (Extended Data Fig. 9). It is noteworthy that some of 
the preserved melanosomes are among the largest known from either 
fossil or modern feathers’. The membranous tissue contains only small 
phaeomelanosomes (long axis 300-400 nm), unlike the filamentous 
feathers in which eumelanosomes predominate and relatively few 
phaeomelanosomes are present (see Supplementary Information for 
details). 

A numerical phylogenetic analysis places Yi within the enigmatic 
Scansoriopterygidae family'*"’’”, which in turn is posited as the sister 
taxon to other paravians (Fig. 3 and Extended Data Fig. 10). 
Scansoriopterygids including Yi have numerous unusual skeletal fea- 
tures, including a highly elongated third finger and a highly modified 
pelvis'*’*’”. They also display unusual integumentary features: despite 
being basal paravians, they lack pinnate feathers, but their filamentous 
feathers resemble the pinnate feathers of other pennaraptorans in 
having morphologically diverse melanosomes’. Given that scansoriop- 
terygids are phylogenetically nested within Pennaraptora, they 
appear to represent a case of extreme morphological divergence near 
the origin of birds, involving not only extensive modification of the 
skeleton but also secondary loss of normal pennaraptoran pinnate 
feathers. 

Large flight feathers and even propatagia are present in the basal 
oviraptorosaur Caudipteryx and the basal deinonychosaurs Anchiornis 
and Microraptor'* (and a propatagium has been identified in the 
scansoriopterygid Epidendrosaurus”), indicating that the feathered 
wing is a primitive feature of the Pennaraptora even though basal 
members of this group are flightless’. Both phylogenetic bracketing” 
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Other coelurosaurs 


-——Epidendrosaurus ningchengensis 
1——Epidexipteryx hui 
L_yj qi 

t——Jeholornis prima 
-——Sapeornis chaoyangensis 
~——Confuciusornis sanctus 


-——Protopteryx fengningensis 
—Yanornis martini 


-—— Archaeopteryx lithographica 

"—— Welinhoferia grandis 

-——Anchiornis huxleyi 

——Xiaotingia zhengi 

-—— Sinovenator changii 

t——Mei long 

Byronosaurus jaffei 

[——Sinornithoides youngi 

1GM100/44 

-—— Troodon formosus 

Saurornithoides mongoliensis 

'— Zanabazar junior 

-—— Buitreraptor gonzalezorum >= 
t—— Unenlagia “4 

"——Rahonavis ostromi 


soneied 


eRlelAy 


eunesoyoAuouleq 


~— Sinornithosaurus millenii 


-—— Microraptor zhaoianus 
"—NGMC91 

-—— Bambiraptor feinbergi 
t-— 1GM100/1015 

t—— Adasaurus mongoliensis 
-—— Velociraptor mongoliensis 
t—— Saurornitholestes langstoni 
-—— Deinonychus antirrhopus 
-— Achillobator giganticus 
|——Atrociraptor marshalli 

-—— Dromaeosaurus albertensis 
"—— Utahraptor ostrommaysi 


Figure 3 | Simplified coelurosaurian phylogeny showing the recovered 
position of Yi. The skeletal silhouette and two possible alternative planform 
reconstructions of Yi highlight the proportionally long and robust forelimbs 
and large leg feathers that Yi shares with other basal paravian theropods, 
indicating the presence of aerial capability, and the inferred membranous 
wings, a feature unique among known paravians but seen in most other gliding 
or flying tetrapods. Various uncertainties, such as how the styliform element is 
oriented and whether membranous tissue is present lateral to the trunk as in 
most volant tetrapods, imply that a variety of reconstructions of the 
aerodynamic apparatus of Yi are currently plausible (see Supplementary 
Information for additional possible reconstructions). 
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Figure 2 | Soft tissues preserved in Yi qiholotype 
(STM 31-2). a-e, Feathers above the skull 

(a), along the humerus (b) and along the tibiotarsus 
(c); isolated basally converging (d) and brush-like 
(e) feathers; f, sheet-like soft tissue associated with 
the right forelimb (yellow arrows point to patches 
of sheet-like soft tissue); g, comparison of sheet-like 
soft tissue (above digit II) and individual feathers 
(below digit II). 


and the preserved morphology of the shoulder girdle and forelimb 
suggest that Yi resembles other basal paravians in the overall skeletal 
structure of its thoracic girdle and appendages. In Yi, however, the 
large flight feathers present in other pennaraptorans appear to be at 
least partly replaced by membranes supported by the styliform element 
and manual digits, in stark contrast to the archetypal wings that are 
universal in birds and their closest relatives. The highly elongated 
manual digit IV of Yi and other scansoriopterygids is unique among 
theropods but superficially similar to the long manual digits II-V of 
bats and the highly elongated fourth finger in pterosaurs. Furthermore, 
some major components of the wing of Yi, such as the membrane and 
styliform element, are not known in other winged theropods. The 
discovery of Yi thus adds considerably to the known diversity of thor- 
acic appendage morphologies present near the transition to birds. 

Because other amniotes that possess a styliform element invariably 
utilize this structure to support an aerodynamic membrane that con- 
tributes to gliding or powered flight®, and alternative functions for a 
long, unjointed rod of bone or cartilage extending from a distal limb 
joint are difficult to imagine (see Supplementary Information), the 
occurrence of a styliform element in Yi is a strong indication that 
membranous aerodynamic surfaces and some degree of aerial capabil- 
ity were present in this taxon. Further evidence that Yi was volant 
comes from the preserved patches of membranous tissue associated 
with the styliform element and digits, and from the proportions of the 
appendicular skeleton (see Supplementary Information). It is worth 
emphasizing that, even if our inference that Yi was an aerially adapted 
taxon with membranous wings proves incorrect, the styliform element 
and membranous tissue patches will stand as highly unusual features 
that imply some kind of equally distinctive forelimb function in this 
peculiar theropod. 

Even under our preferred interpretation, the flight apparatus of Yi 
cannot be confidently reconstructed due to the incomplete preser- 
vation of the wing membrane in the only known specimen, combined 
with uncertainties regarding the styliform element’s orientation and 
connection to the wrist. However, the range of possible flight apparatus 
configurations can be explored by considering different reconstruc- 
tions (Fig. 3 and Extended Data Fig. 7) that are supported to varying 
degrees by morphological and taphonomic information from the spe- 
cimen, fundamental aerodynamic principles, and the aerial loco- 
motion of other volant tetrapods. Preliminary analysis of the wing 
loadings and other properties of some of these configurations supports 
the plausibility of Yi as a volant taxon (see Supplementary 
Information). In having wings with a well-developed membranous 
component, Yi would differ from other volant paravians but resemble 
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distantly related groups including pterosaurs, bats and many gliding 
mammals, representing a striking case of convergent evolution of the 
aerodynamic apparatus among tetrapods*””. 

The mode of aerial locomotion that might have been used by Yi is 
difficult to reconstruct on the basis of present evidence. Yi may have 
been capable of flapping flight or only of gliding, or may have com- 
bined the two locomotor styles as in many extant birds and some bats 
(see Supplementary Information). There are some indications that Yi 
may have relied more on gliding than on flapping, including the lack of 
strongly expanded muscle attachment surfaces on the forelimb bones 
and the possibility that the unwieldy styliform element would have 
interfered with the rapid oscillations and rotations of the distal part of 
the forelimb needed for efficient flapping flight, but the mode of aerial 
locomotion that is most likely for Yi remains uncertain. Regardless, the 
evident occurrence in this taxon of a membranous wing supported by a 
styliform element represents an unexpected aerodynamic innovation 
close to the origin of birds, and highlights the breadth of flight-related 
morphological experimentation that took place in the early stages of 
paravian history. 
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Extended Data Figure 1 | Fossil-bearing beds of the Middle-Upper Jurassic (approximately 20 km apart) and very similar in their sedimentology and fossil 
Tiaojishan Formation in Qinglong County, Hebei. The Mutoudeng Locality content. The bottom image shows an excavation we organized at the Gangou 
(top) and Gangou Locality (bottom) are geographically close to each other Locality. 
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Extended Data Figure 2 | Photograph of the slab and counter slab of STM 31-2. 
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Extended Data Figure 3 | Close-ups of the three counter slab fragments and __(left), and corresponding region of slab (right); bottom row: isolated piece from 


corresponding slab regions of STM 31-2. Top row: isolated piece from counter slab preserving hindlimb region (left), and corresponding region of 
counter slab preserving cranial region (left), and corresponding region of slab __ slab (right). 
(right); middle row: isolated piece from counter slab preserving forelimb region 
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Extended Data Figure 4 | Volumetric model generated from superimposed CT slices. Because there is little density contrast between the fossil and the matrix, 
the outline of the fossil is somewhat vague. 
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Extended Data Figure 5 | Photographs of preserved integumentary features _soft tissue and feathers associated with the right forelimb and feathers along the 
of STM 31-2. a-c, Feathers over the neck (a), along the humerus (b) and along __ right hindlimb; f, g, close-ups of sheet-like soft tissue associated with the right 
the humerus and ulna (c); d, isolated basally converging feathers; e, sheet-like forelimb (f) and distal phalanges of manual digit IV (g). 
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Extended Data Figure 6 | Elemental compositions of the styliform element and comparison samples based on EDS analyses. EDS spectra derived from: 
a, right manual phalanx II-1; b, right styliform element; c, feathers associated with right tibia; and d, sedimentary matrix. 
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Extended Data Figure 7 | Aerodynamic apparatuses in Yi and other tetrapods. a—c, Wing of Yi, Bat Model (a), Maniraptoran Model (b), and Frog Model 
(c); d, bat wing; e, pigeon wing; f, pterosaur wing; g, giant Japanese flying squirrel wing. Yellow colour indicates styliform element in a, b, c, d, f, and g. 
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Extended Data Figure 8 | Scanning electron microscopy sample locations. _cranial feathers; 6, proximal part of neck feathers; 7, distal part of humeral 
Numbered circles indicate the locations of samples from the slab and counter _ feathers; 8, distal part of humeral feathers; 9 and 10, membranous soft tissue 
slab. 1, distal part of cranial feathers; 2, middle part of tibial feathers; 3, distal near digit IJ; 11, membranous soft tissue near digit IV; 12, middle part of tibial 
part of humeral feathers; 4, distal part of ulnar feathers; 5, proximal part of feathers. 
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Extended Data Figure 9 | Scanning electron microscopy images of elliptical eumelanosomes in the feathers near the skull (b, c), neck 
melanosomes and melanosome impressions preserved in Yi qi holotype (d, e), humerus (f, g), and ulna (h, i, j); and large oval and elliptical 

(STM 31-2). a-m, Note the high morphological diversity of the preserved eumelanosomes in the feathers near the tibiotarsus (k, 1). The subspherical 
melanosomes in the feathers, which include: densely distributed, small round — phaeomelanosomes in the sheet-like soft tissue (m) appear to be less densely 
phaeomelanosomes and sparsely distributed, medium-sized oval distributed than the melanosomes in the feathers. 


eumelanosomes in the feathers near the skull (a); medium-sized elongate 
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Bees prefer foods containing neonicotinoid 


pesticides 


Sébastien C. Kessler", Erin Jo Tiedeken*, Kerry L. Simcock', Sophie Derveau’, Jessica Mitchell*, Samantha Softley’, Jane C. Stout? 


& Geraldine A. Wright! 


The impact of neonicotinoid insecticides on insect pollinators is 
highly controversial. Sublethal concentrations alter the behaviour 
of social bees and reduce survival of entire colonies’. However, 
critics argue that the reported negative effects only arise from 
neonicotinoid concentrations that are greater than those found 
in the nectar and pollen of pesticide-treated plants*. Further- 
more, it has been suggested that bees could choose to forage on 
other available flowers and hence avoid or dilute exposure*’. Here, 
using a two-choice feeding assay, we show that the honeybee, Apis 
mellifera, and the buff-tailed bumblebee, Bombus terrestris, do not 
avoid nectar-relevant concentrations of three of the most com- 
monly used neonicotinoids, imidacloprid (IMD), thiamethoxam 
(TMX), and clothianidin (CLO), in food. Moreover, bees of both 
species prefer to eat more of sucrose solutions laced with IMD or 
TMX than sucrose alone. Stimulation with IMD, TMX and CLO 
neither elicited spiking responses from gustatory neurons in the 
bees’ mouthparts, nor inhibited the responses of sucrose-sensitive 
neurons. Our data indicate that bees cannot taste neonicotinoids 
and are not repelled by them. Instead, bees preferred solutions 
containing IMD or TMX, even though the consumption of these 
pesticides caused them to eat less food overall. This work shows 
that bees cannot control their exposure to neonicotinoids in food 
and implies that treating flowering crops with IMD and TMX 
presents a sizeable hazard to foraging bees. 

Determining the impacts of pesticides on pollinators is important 
to resolve for the future of world food security. Pollinating insects 
like bees increase the yields of human crops, but in doing so, are 
inadvertently exposed to pesticides in floral nectar and pollen®”’. 
Several studies have concluded that bees exposed to sublethal doses 
of neonicotinoid pesticides in food have difficulty learning floral traits, 
feeding, navigating and foraging’’*""’, and have impaired motor func- 
tion’. These changes in behaviour often lead to colony failure’’. 
This body of work has galvanized public concern over bee welfare, 
and in 2013, led to a two-year ban on the use of the three most 
common neonicotinoids (IMD, TMX, CLO) on flowering crops by 
the European Union. The agricultural importance of these pesticides 
has motivated agrochemical producers and government scientists to 
challenge this ban. Critics of laboratory-based experiments contend 
that such studies use food laced with neonicotinoid concentrations 
that exceed the levels found in nectar and pollen”, or give bees no 
choice of food solutions**. They propose that free-living bees and other 
insect pollinators could choose to avoid the nectar and pollen of pes- 
ticide-treated crops* if pollinators are repelled by neonicotinoids’*”, 
and if alternative sources were provided such as field margins in agri- 
cultural settings. 

These arguments require that pollinators are able to detect neoni- 
cotinoids in food in order to avoid exposure. We tested whether bees 
avoid sucrose solutions (that is, nectar) containing neonicotinoids 
using a two-choice test designed to identify the bumblebee’s gustatory 


detection thresholds for nectar toxins'®. Individual foraging-age 
worker bumblebees or cohorts of 25 forager honeybees were housed 
in plastic boxes for 24 h and given access to two types of food tubes: one 
containing sucrose solution and one containing sucrose solution laced 
with a specific concentration of the IMD, TMX or CLO. The concen- 
trations used included values in the range reported from nectar and 
pollen (0.5-150 nM, Extended Data Table 1). Neither bumblebees nor 
honeybees avoided concentrations found within the naturally occur- 
ring range (Fig. 1a, b), even though high concentrations of TMX and 
CLO reduced their survival (Extended Data Fig. 1). We also tested 
whether these pesticides inhibited the honeybee’s feeding reflex (pro- 
boscis extension) or caused honeybees to retract the proboscis once 
extended’”. None of the sucrose solutions containing IMD, TMX or 
CLO affected proboscis extension or retraction (Extended Data Fig. 2). 

Unexpectedly, we observed that both bumblebees and honeybees 
showed a preference for solutions containing IMD or TMX over suc- 
rose alone (Fig. 1, Extended Data Tables 2, 3). Concentrations of IMD 
and TMX proximate to those found in nectar (1-10nM, Extended 
Data Table 1) were most attractive to bumblebees (Fig. 1a), whereas 
honeybees preferred to consume IMD and TMX across a broader range 
of concentrations (Fig. 1b). The ‘attractive’ effect of IMD also depended 
on bee age: newly emerged adult worker bumblebees and honeybees 
largely avoided 1-10 nM IMD (Extended Data Fig. 3a). In addition, the 
presence of neonicotinoids influenced the total amount of food con- 
sumed from both tubes during 24h (Fig. 1c, d). Bumblebees fed with 
IMD or CLO consumed less total food on average than those fed TMX 
or the sucrose control (Fig. 1c, Extended Data Table 2); this effect has 
also been observed by others’"”’. In contrast, the total food consump- 
tion of forager honeybees was reduced only when bees fed from solu- 
tions containing 100 nM or 1 uM TMX or CLO (Fig. 2d, Extended Data 
Table 2). Thus, even in treatments where bees ate considerably less food 
in 24h, they still preferred to consume solutions containing IMD over 
sucrose alone. Bumblebees also consumed 1.5-10-fold more of the 
neonicotinoid-laced food than honeybees and were, therefore, exposed 
to higher pesticide doses (Extended Data Table 4). 

Insects detect nutrients and toxins in food via gustatory neurons in 
hair-like sensilla on the proboscis (mouthparts)'*®. Toxic, non-nutri- 
tious compounds elicit spikes in ‘bitter’-sensing neurons’””®, but can 
also be detected via suppression of the responses of sugar-sensing 
neurons”'”’, Previous research has established that gustatory neurons 
located in sensilla on the honeybee’s mouthparts are more sensitive to 
toxins in food’’ than its antennae”’ or tarsi’’. If bees have mechanisms 
for detecting neonicotinoids, sensilla on the mouthparts should 
respond to these substances in the same way they respond to other 
toxins’”. To test this, we recorded from gustatory neurons in sensilla on 
the galea (part of the proboscis) of bumblebees and honeybees using 
the tip recording technique (Fig. 2a, b). Stimulation with IMD, TMX or 
CLO in water did not elicit spikes from any of the neurons in the galeal 
sensilla of either bumblebees (Fig. 2c) or honeybees (Fig. 2d), whereas 


lMnstitute of Neuroscience, Newcastle University, Newcastle upon Tyne NE2 4HH, UK. @Botany Department, Trinity College Dublin, Dublin 2, Ireland. School of Biology, Newcastle University, Newcastle 
upon Tyne NE1 7RU, UK. “Centre for Neural Circuits and Behaviour, Tinsley Building, University of Oxford, Oxford OX1 3SR, UK. 
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Figure 1 | Foraging-age bees prefer to eat food containing neonicotinoids. 
a, b, Bumblebees (a) and honeybees (b) given a choice of sucrose or sucrose 
containing a neonicotinoid pesticide chose to eat solutions containing IMD and 
TMX (Extended Data Table 2, bumblebees: generalized linear model (GLM): 
% = 12.1, P = 0.002; honeybees: GLM, 7,” = 11.1, P = 0.004). Data represent 
the mean difference in the amount consumed over 24 h; positive values indicate 
a preference for solutions containing neonicotinoids. White bars indicate the 
sucrose control. Asterisks indicate P = 0.002 (Bonferroni-adjusted critical 
value) for one-sample t-tests against the ‘0’ value (indicating no preference, see 
Extended Data Table 3). Sample sizes: bumblebees: IMD: 1 nM = 57, 

10 nM = 66, 100 nM = 65, 1 uM = 66; TMX: 1 nM = 38, 10nM = 39, 


stimulation with nicotine hydrogen tartrate (NHT), KCl and sucrose 
did (Fig. 2c-f). This effect was the same for all three neonicotinoids in 
both bee species (Extended Data Table 5). To test whether neonicoti- 
noids are detected via suppression of the neurons’ responses to sugars, 
we applied sucrose solution laced with IMD, TMX and CLO in an 
ascending series of concentrations from 1nM to 1 uM (Fig. 2g, h). 
None of the concentrations we tested altered the spiking activity of 
sucrose-sensitive gustatory neurons in the bumblebees’ or the honey- 
bees’ sensilla (Fig. 2g, h, Extended Data Table 5). (Note: we confirmed 
that the mean spike rates reported in Fig. 2h were not a result of 
simultaneous excitation of bitter neurons and inhibition of sucrose- 
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Figure 2 | Electrophysiological recordings of the gustatory receptor 
neurons from the mouthparts of bumblebees and honeybees during 
stimulation with neonicotinoids. a, b, Scanning electron micrographs (SEM) 
of the galea of bumblebees (a) and honeybees (b). Recordings were made from 
the basiconic sensilla of the galea (white arrows); inserts are higher resolution 
SEM of individual sensilla. c, d, Spike trains recorded from both species reveal 
responses to NHT and to sucrose, but not to IMD. e, f, Boxplots of the spiking 
responses of gustatory neurons of the mouthparts of bumblebees (e) and 
honeybees (f) to KCl, NHT and two concentrations of each of the 
neonicotinoids. Dashed lines represent the median response to 50 mM sucrose. 
Solutions of the three neonicotinoids did not elicit activity from gustatory 
neurons greater than the response to water (indicated as ‘0’ on x axis) 
(Extended Data Table 5, ANOVA: bumblebees: F277 = 0.935, P = 0.397; 
honeybees: F144 = 2.38, P = 0.096). (Note: NHT elicited spike frequencies in 
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100 nM = 36, 1 uM = 40; CLO: 1 nM = 57, 10nM = 59, 100 nM = 48, 

1 uM = 62. Honeybees: n = 40 cohorts of 25 bees per treatment. Experiments 
were replicated with individuals taken from over 20 different bumblebee 
colonies and 4 honeybee colonies. c, The total amount of food eaten from both 
tubes by bumblebees was affected by the concentration and the presence of a 
neonicotinoid pesticide (GLM: 16 = 47.7, P<0.001, Extended Data Table 2) 
in one of the food tubes. d, Honeybees ate less total food only when it contained 
1,000 nM TMX or CLO (GLM: %- = 10.5, P= 0.005, Extended Data Table 2). 
White diamonds indicate amount eaten by sucrose control group. *P < 0.05 in 
post hoc comparisons against sucrose. Error bars represent + s.e.m. 


sensing neurons by manually spike sorting the records for IMD, 
Extended Data Fig. 4.) Furthermore, we found that both forager and 
newly emerged honeybees lack taste neurons that respond to these 
compounds (Extended Data Fig. 3b). Therefore, the behavioural data 
and electrophysiological recordings from mouthparts’ gustatory neu- 
rons lead us to conclude that bumblebees and honeybees cannot taste 
neonicotinoids in nectar. 

The preference of the bees in our assays for solutions containing 
IMD or TMX probably arises from the pharmacological action of these 
compounds on nicotinic acetylcholine receptors (nAChRs) in the 
bees’ brains. It does not reflect a generalized enhancement of feeding 
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gustatory neurons greater than those elicited by water in only 11/17 of the 
bumblebees we tested, whereas NHT elicited spike frequencies greater than 
water in all of the honeybees tested). Sample sizes: bumblebees: myn = 5; 
Nyx = 7; Mcto = 5. Honeybees: myn = 53 Nrmx = 53 Ncto = 6. g; h, The 
spiking response to sucrose was not reduced by the presence of the 
neonicotinoids at concentrations in the nectar-relevant range (Extended Data 
Table 5, ANOVA: bumblebees: Fg = 0.579, P = 0.449; honeybees: 

Fi 127 = 2.00, P = 0.053). Bumblebees: nyyp = 8; Ntx = 53 NcLo = 6. 
Honeybees: nyyrp = 6; Mrix = 5; NcLo = 6. Boxplots represent the median 
(black bars), the 1.5 interquartile range (whiskers) and outliers (circles). Stimuli 
on x axes of e-h are in order of presentation during the experiment. 
Bumblebees in both experiments were randomly selected from 8 colonies; 
honeybees in both experiments were randomly selected from 4 colonies. N, 


NHT; S, sucrose. 
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because bees consuming these pesticides ate less food overall. 
Remarkably, the preference occurred even when bees consuming these 
solutions were more likely to die. Our data may indicate, therefore, that 
IMD and TMxX affect the neural mechanisms involved in learning 
about the location of rewarding food. Previous studies have demon- 
strated that free-flying honeybees prefer to collect sucrose solutions 
containing low concentrations of nicotine”. Nicotine also activates 
nAChRs” expressed throughout the bee brain, including the mush- 
room bodies required for learning and memory”. It is notable that 
several studies have shown that chronic neonicotinoid administration 
impairs olfactory learning and memory in honeybees'*”*”. Our find- 
ing that bees acquire a preference for food laced with IMD or TMX 
could be explained by shorter neonicotinoid exposure in our experi- 
ments or by differential sensitivity of the nAChRs in the relevant brain 
regions necessary for each task’®. It is also plausible that differential 
sensitivity of nAChRs accounts for our observed avoidance of newly 
emerged bees towards solutions containing IMD. 

Consumption of neonicotinoid-laced nectar by foraging bees could 
lead to higher attrition in this behavioural caste as well as reducing 
their foraging efficiency for pollen*”°. This would have a greater impact 
on solitary bee species and on wild bee colonies with relatively few 
foragers than on honeybee colonies. If foragers prefer to collect nectar 
containing IMD and TMx, they will also bring more neonicotinoid- 
laced food back to the colony. For these reasons, whole colonies could 
be exposed to higher levels of these pesticides in the field than had been 
predicted previously. Mitigation strategies that rely on planting 
alternative sources of nectar and pollen, therefore, might not be 
enough to decrease the risk of poisoning pollinators with pesticides. 
Instead, long-term changes to policy that include reducing their use 
may be the only certain means of halting pollinator population decline. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Behavioural two-choice assays. Experiments were performed at Trinity College, 
Dublin with Bombus terrestris dalmatinus (Unichem Ltd, Co. Dublin, Irish dis- 
tributor for Koppert). Colonies were maintained at 25-30 °C in 24h darkness and 
fed commercial pollen and Biogluc (Agralan Ltd, Swindon) bee food ad libitum. 
Experiments were also performed at Newcastle University, Newcastle upon Tyne 
with Bombus terrestris audax (Biobest, Belgium) and Bombus terrestris terrestris 
(Koppert Biological Systems, NATURPOL, Netherlands). Bees from 3-5 different 
colonies were used for each neonicotinoid. Individual worker bumblebees 
were collected as they tried to exit the colony. For the experiments with newly 
emerged bumblebees, colonies were monitored for newly emerged bees daily; 
newly emerged adults were identified by their pale colour. These bees were 
extracted using forceps from within the colony. As previously described in 
Tiedeken et al. (2014)’*, individual bumblebees were cold anaesthetized, weighed 
and sex-determined, and transferred to individual 650ml plastic containers 
(160 X 110 X 45mm). Containers were fitted with three 3 ml feeding tubes, 
inserted horizontally. Feeding tubes had four 2mm holes so bees could alight 
on the tubes and feed from the openings. The feeding tubes contained one of three 
solutions: (1) deionized water; (1) 0.5M sucrose; or (3) 0.5M sucrose with a 
specific concentration of a neonicotinoid compound. Whether or not the bee 
was alive was noted 24h after the start of the experiment. Bees that did not drink 
from either tube were excluded from the final analysis; the total number of these 
subjects was never greater than 3 per treatment (note: these subjects were always 
dead and likely to have died from stress or other causes). 

Experiments with honeybees (Apis mellifera var. Buckfast) were performed at 
Newcastle University during the summer months using 2 free-flying outdoor 
colonies originally obtained from the UK’s National Bee Unit (Sand Hutton, 
Yorkshire). Foraging adult worker honeybees were collected at the colony entrance 
as they returned from foraging; newly emerged adult workers were collected from 
brood comb as they emerged in a purpose-built box kept in an incubator at 34 °C. 
Bees were cold anaesthetized before placing in rearing boxes. Cohorts of 25 bees 
were placed in rearing boxes as previously described in Paoli et al. (2014)°". Five 
food tubes (as described above) were provided: (1) one with deionized water; (2) 
two with 1 M sucrose; (3) two with 1 M sucrose containing a specific concentration 
of a neonicotinoid. The number of bees alive in each cohort was counted at the 
time of measurement of the food consumption (24h later). 

All of the two-choice experiments were performed experimenter-blind (except 
IMD with bumblebees). Three neonicotinoid pesticides, imidacloprid (IMD), 
thiamethoxam (TMX) and clothianidin (CLO), were used in the experiments 
(Pestanal, Sigma-Aldrich). The neonicotinoid concentrations used were 1 nM, 
10nM, 100nM, 14M (see Extended Data Table 4 for conversions to ppb and 
ng per bee). Bees were kept in continuous darkness for 24h at constant temper- 
ature and 60% RH (bumblebees: 28 °C; honeybees: 34 °C). Control boxes identical 
to the experimental boxes (without bees) for each neonicotinoid treatment were 
placed in the incubator simultaneously with the experiments to measure the rate 
of evaporation from the food solutions. Feeding tubes were weighed, placed 
in the experimental boxes with the bees for 24h, and then removed and weighed 
a second time. The position of the treatment tubes was randomized across 
subjects. The amount of solution consumed was determined as the difference 
in the weight of each tube after 24h; the average value for the evaporation 
control for each treatment was subtracted from this final value for each tube. 
For bumblebees, sample sizes were: IMD: 1 nM = 57, 10nM = 66, 100 nM = 65, 
1uM=66; TMX: 1nM=38, 10nM=39, 100nM= 36, 1uM= 40; CLO: 
1nM =57, 10nM = 59, 100 nM = 48, 1 1M = 62. For honeybees, m = 40 cohorts 
of 25 bees per treatment. Sample size was chosen as n = 40 based on previous 
work'*; sample size varied because some individuals died from unknown causes at 
the start of the experiments. No statistical methods were used to predetermine 
sample size. 

Honeybee antennal and mouthparts assays. Honeybees were collected at the 
entrance of an outdoor colony as they returned from foraging, cold-anaesthetized, 
and harnessed as described in Bitterman et al. (1983)**. Each was fed 1 M sucrose 
to satiety and left overnight in a humidified plastic box and assayed ~ 18h later. 
Briefly, two assays were employed: one in which individual honeybees were lightly 
tapped on the antenna with a stimulating solution (for example, sucrose) to elicit 
the feeding reflex (that is, proboscis extension reflex, or PER) anda second assay in 
which a droplet of stimulating solution was placed at the end of the extended 
proboscis to test whether bees would consume it (further details described in 
Wright et al. 2010'’). Stimulating solutions were 1 M sucrose containing one of 
the following concentrations (1 nM, 10 nM, 100 nM, 1 1M, 10 11M) of one of three 
neonicotinoids (IMD, TMX, CLO). 

Electrophysiology. Individual bumblebees (B. terrestris audax and B. terrestris 
terrestris) and honeybees were cold-anaesthetized on ice for 3-5 min, and then 
restrained in a metallic restraining harness as described in Bitterman et al. 
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(1983)**. To avoid any movements of the mouthparts during recordings, mus- 
cles that trigger proboscis retraction were cut by making an incision at the level 
of the proboscis fossa. Each galea was fixed with a curved metallic wire pinned 
into dental wax. 

Electrophysiological recordings were made from taste neurons located in the 
first 11 sensilla chaetica*’ located at the tip of the galea on the honeybee’s 
proboscis as in Wright et al. (2010)’ and in the first 6 sensilla in bumblebees. 
Bees were electrically grounded via a chlorinated silver wire inserted into the 
head. Sensilla were visualized under a microscope (M205C, Leica, Germany) at a 
magnification of X256. To record from gustatory neurons, we used a method 
first described by Hodgson et al. (1955)**. Sensilla were stimulated with a record- 
ing borosilicate electrode (50mm long, 20 1m diameter) containing the test 
compounds diluted in demineralized water. The recording electrode was 
connected via a chlorinated sliver wire to a high impedance ‘non-blocking’ 
pre-amplifier (TastePROBE, Syntech, Germany)** mounted on a motorized 
micromanipulator (MPC-200, Sutter Instrument, USA). The signal was further 
amplified and filtered with an AC amplifier (model 1800, gain: 100, band- 
pass filter: 10-1,000 Hz, A-M Systems, USA). Each stimulus trial was digitized 
(sampling rate 10 kHz, 16 bits; DT9803 Data Translation), stored on a com- 
puter with dbWave software (version 4.2014.3.22) and analysed with Matlab 
R2012b (version 8.0.0.783) using PeakFinder with fixed thresholds as the peak 
detection algorithm (PeakFinder.m., Mathworks file ID: 25500). Recordings 
were made for 2s, but only data for the first second were included in the 
analysis. The first 100 ms were removed to avoid the contact artefact. For bum- 
blebees, 2-6 sensilla were sampled per bee; for honeybees, 6-10 sensilla were 
sampled per bee. 

Recording started when the open end of the electrode was placed over the tip of 
the sensillum. Individuals were repeatedly sampled in one of two protocols: (1) 
50 mM sucrose, 100 mM KCl, water, 1 1M neonicotinoid, 1 mM neonicotinoid, 
1mM NHT, 100mM KCI, 50 mM sucrose; or (2) 50 mM sucrose, 50 mM sucrose 
+ neonicotinoid in one of the following concentrations (1nM, 10nM, 1M), 
50 mM sucrose. The neonicotinoids IMD, TMX, or CLO were used in each pro- 
tocol. Neonicotinoid (Pestanal, Sigma-Aldrich) solutions were prepared as serial 
dilutions starting with 1 mM concentration. Sucrose and nicotine tartrate were 
purchased from Sigma-Aldrich and KCl from Fisher Scientific at purity = 98%. 
Demineralized water was used to prepare all solutions. Intervals between stimuli 
were 2-5 min. 

Recordings with IMD diluted in sucrose (Extended Data Fig. 4) were further 
analysed using dbWave (http://perso.numericable.fr/frederic.marion-poll/deter- 
rents/tk/dbwave/index.htm). Predicted spiking neurons or ‘units’ were sorted 
from the digitally filtered signals according to their amplitude with the help of 
interactive software procedures. Electrophysiological recordings were then visu- 
ally inspected to search for spike doublets, that is, two spikes separated by an 
interspike interval shorter than the silent period**’’. Spike trains were analysed 
over 1s following the first 100 ms removed to avoid the contact artefact. 
Electron microscopy. Scanning electron microscopy was performed using a 
Cambridge Stereoscan 240 on samples that had been fixed with glutaraldehyde, 
washed in phosphate buffer then dehydrated through an ethanol gradient followed 
by critical point drying. Specimens were then mounted on an aluminium stub with 
Acheson’s silver dag before gold coating with a Polaron SEM coating unit. 
Statistics. All analyses were performed using IBM SPSS v 19. The mean total 
number of spikes in the electrophysiological recordings was analysed using 
repeated-measures analysis of variance (ANOVA) for each species with neonico- 
tinoid as a main effect, sensillum number and bee as covariates, and stimulus as a 
repeated measure; a Levene’s test was employed to test for equality of variance. 
Post hoc comparisons were pairwise t-tests with a Bonferroni adjustment for 
experiment-wise error rate. A two-way generalized linear model (GLM) was used 
to compare the behaviour of bees fed each of the neonicotinoid treatments for each 
bee species with least squares post hoc comparisons (Note: the sucrose-sucrose 
choice data were not included because of the requirements of GLM for factorial 
design). The difference in the amount eaten between the 2 food tubes in the 
behavioural choice assays was also analysed using a one-sample t-test against zero 
for each treatment; critical values were Bonferroni-adjusted. The proportion of 
bees alive after 24h was analysed using logistic regression (lreg). Each individual 
bee was entered in the analysis for the experiments with bumblebees and with 
honeybees. For the analysis with honeybees, ‘cohort’ was entered as a covariate. No 
statistical methods were used to predetermine sample size. 
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Extended Data Figure 1 | The proportion of bees surviving after 24h inthe survive after 24h (Ireg: IMD: ra = 5.18, P = 0.269; TMX: we = 577,P< 0.001; 
two-choice assay. Data from Fig. 1. a, Bumblebees given a choice between CLO: x,” = 243, P< 0.001). Cohort (cov) accounted for a significant portion of 
sucrose and sucrose laced with 1,000 nM TMX or CLO were less likely to the variance in survival for all three treatment groups (Ireg: IMD: 7,” = 22.0, 
survive after 24h (lreg: IMD: 7,” = 4.36, P = 0.359; TMX: 7,4” = 62.3, P<0.001; TMX: y,° = 32.4, P< 0.001; CLO: 7,7 = 70.2, P< 0.001). Sample 
P<0.001; CLO: To; = 79.7, P<0.001). b, Honeybees given a choice between __ sizes are the same as in Fig. 1. *P < 0.05 in least squares post hoc comparisons 
sucrose and sucrose laced with 1,000 nM TMX or CLO were less likely to against sucrose in each treatment 
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a. Antennal stimulation 


0 
© -0.2 
n 
2 
(3) 
B 
s -0.4 
2 
® -0.6 
S ==O== IMD 
a 
Q == TMX 
© -08 
< —t-= CLO 
-1 
0.1 1 10 100 1k 10k 
Conc neonicotinoid (nM) 
b. Mouthparts stimulation 
0) _—— 
2 -0.2 
2 
(3) 
a 
s 04 
2 
2 -0.6 
6 
a === IMD 
2 08 —B—= TMX 
< 
<= CLO 
-1 
0.1 1 10 100 1k 10k 


Conc neonicotinoid (nM) 


Extended Data Figure 2 | Antennal proboscis extension response (PER) and _to consume solutions containing neonicotinoids; only one bee in the CLO 
mouthparts assay of honeybees to solutions containing neonicotinoids. treatments failed to drink the solutions. n = 40 per neonicotinoid treatment for 
a, Stimulation of the antennae with 1 M sucrose solutions containing antennal stimuli and n = 10 for each concentration of each neonicotinoid for 
neonicotinoids did not affect the elicitation of PER. b, Honeybees didnotrefuse the mouthparts taste assay. Bees were randomly selected from 2 colonies. 
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a. Behavioural 2-choice assay 
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b. Gustatory neuron responses (A. mellifera) 
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Extended Data Figure 3 | Young bees avoid solutions containing 
neonicotinoids. a, Newly emerged worker bumblebees (n = 30 bees per 
treatment) and honeybees (n = 20 boxes per treatment) were tested in the 
behavioural choice assay with 1 nM and 10nM IMD in sucrose solution as in 
Fig. 1. Bumblebees avoided consuming both solutions containing IMD (one- 
sample t-test against 0, 1 nM: P< 0.001, 10 nM: P = 0.001), whereas honeybees 
avoided only the 1 nM concentration (one-sample t-test against 0, 1 nM: 

P= 0.003, 10 nM: P = 0.773). Error bars represent + s.e.m. b, The presence of 
IMD did not alter the spike frequency of gustatory neurons in the galeal sensilla 
of newly emerged honeybees (repeated-measures ANOVA, stimulus: 

Fy 47 = 0.207, P = 0.653). Recordings were made from the basiconic sensilla on 
the galea as in Fig. 2. Boxplots represent the frequencies of responses to 50 mM 
sucrose or to 50 mM sucrose solutions containing 1 nM or 10nM IMD. n=5 
bees, 10 sensilla per bee. Boxplots represent the median (black bars), the 1.5 
interquartile range (whiskers) and outliers (circles). Stimuli on x axis are in 
order of presentation during the experiment. 
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Extended Data Figure 4 | Spike-sorted recordings. Data from four of the 
honeybees in Fig. 2h. a, To verify that the spike rates we observed in Fig. 2h were 
not a result in changes in the rates of firing of individual neurons, we spike- 
sorted recordings from four honeybees stimulated with sucrose and IMD. 

b, Spike sorting revealed two potential spiking neurons (units) characterized by 
different spike amplitudes; both units spiked in response to sucrose stimulation. 
(This was also observed previously by Wright et al. 2010'’). One neuron is 
labelled in green, the other in red. Spike doublets (indicated in pink as “d’) where 
both neurons spiked nearly simultaneously were also observed. c, d, These same 
two spiking neurons continued to respond when stimulated with sucrose 


containing 1 11M IMD. e, Boxplots reveal that the rate of spiking was lower on 
average for one of the neurons (repeated-measures ANOVA, unit: F) 36 = 596, 
P<0.001). The rate of firing of both neurons was not affected by IMD 
concentration (repeated-measures ANOVA, unit: F136 = 0.369, P = 0.547). 
Spikes from additional neurons (units) were not detected, and so we concluded 
that no other neurons were recruited during stimulation with IMD. ‘S indicates 
stimulation with sucrose. Boxplots represent the median (black bars), the 1.5 
interquartile range (whiskers) and outliers (circles). Stimuli on x axis are in 
order of presentation during the experiment. 
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Extended Data Table 1 | Concentrations of neonicotinoids reported in floral nectar 
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Imidacloprid Thiamethoxam Clothianidin 

meuree ng/g PPB nM ng/g PPB nM ng/g PPB nM 
Schmuck et al. 2001’ 1.9 1.9 7.43 - - - - - “ 
Pohorecka et al. 2012 0.6 0.6 2.34 4.2 4.2 14 2.3 2.3 9.2 
Dively and Kamel 2012® | 0.4-11 | 0.4-11 | 1.5-43 | 8.2-95 | 8.2-9.5 | 28-37 3 - s 
Stoner and Eitzer 2012” 10 10 39 1 1 37 - - - 
Byrne et al. 2013” 2.9-39 | 2.9-39 | 11-154 : : 2 : : : 
Larson et al. 2013" - - - - - - 171 171 684 
Pilling et al. 2013” : : - 0.65-2.4 | 0.65-2.4 | 2.2-8.2 - = : 
Defra 2013 0.13 - 0.5 1-3.9 1-3.9 3.4-13 | 0.18-4 | 0.18-4 | 0.7-16 


in this table. 
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Extended Data Table 2 | Generalized linear models for the neonicotinoid choice experiment and total food consumption 


B. terrestris Choice test Total food consumption 
Between-subjects contrasts df va P-value df i P-value 
Concentration 3 27.9 <0.001 3 263 <0.001 
Neonicotinoid 2 12.1 0.002 2 150 <0.001 
Neonic x Conc 6 7.97 0.240 6 47.7 <0.001 
A. mellifera Choice test Total food consumption 


Between-subjects contrasts df Ya P-value df 2 P-value 
Concentration 3 4.93 0.176 3 37.1 <0.001 
Neonicotinoid 2 444 0.004 2 10.5 0.005 
Neonic x Conc 6 5.89 0.435 6 11.4 0.076 


Data from Fig. 1. Values in bold indicate interpreted model parameters. Note: sucrose-sucrose (control) data were not included. 
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Extended Data Table 3 | One-sample t-tests against ‘0’ for each treatment of the 24h behavioural assay 


B. terrestris 


IMD TMX CLO 
N t(df) P-value N t(df) P-value N t(df) P-value 
Sucrose 55 -0.24(54) 0.402 
1nM 57 5.13(56) <0.001* 38 3.11(38) 0.002* 57 0.22(56) 0.246 
10nM 66 2.39(65) 0.010 39 3.11(37) 0.002* 59 0.26(58) 0.183 
100nM 65 2.33(64) 0.012 36 1.31(35) 0.099 48 0.09(47) 0.465 
1M 66 -2.6(65) 0.005 40 -1.15(39) 0.128 62 -2.36(61) 0.021 


A. mellifera 


IMD TMX CLO 
N t(df) P-value N t(df) P-value N t(df) P-value 
Sucrose 40 -0.85(39) 0.199 
1nM 40 1.93(39) 0.031 40 -0.32(39) 0.376 40 -0.288 0.387 
10nM 40 1.75(39) 0.044 40 3.80(39) <0.001* 40 0.882 0.191 
100nM 40 2.97(39) 0.002* 40 3.23(39) 0.001* 40 -0.221 0.414 
1M 40 2.00(39) 0.026 40 3.25(39) 0.001* 40 0.423 0.337 


Data from Fig. 1. P values are for 1-tailed tests. P values in bold are below P = 0.05. *Application of a Bonferroni adjustment criterion alters the P value threshold from P = 0.05 to P= 0.002. 
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Extended Data Table 4 


Comparison of doses consumed by each bee species for each treatment 


B. terrestris 


1nM 10 nM 100 nM 1M 
mli/bee PPB ng/bee/24 h ml/bee PPB ng/bee/24 h ml/bee PPB ng/bee/24h | ml/bee PPB ng/bee/24 h 
IMD 0.257 0.256 0.064(0.043) 0.167 2.56 0.418(0.337) 0.159 25.6 3.98(3.22) 0.055 256 13.9(18.4) 
TMX 0.360 0.292 0.105(0.077) 0.357 2.92 1.05(0.862) 0.354 29.2 10.3(8.74) 0.115 292 33.6(33.9) 
cLo 0.279 0.250 0.070(0.065) 0.259 2.50 0.647(0.600) 0.211 25.0 5.28(4.93) 0.041 250 10.3(13.6) 
A. mellifera 
1nM 10 nM 100 nM 1M 
ml/bee PPB ng/bee/24 h ml/bee PPB ng/bee/24 h ml/bee PPB ng/bee/24h | mli/bee PPB ng/bee/24 h 
IMD 0.046 0.256 0.012(0.010) 0.046 2.56 0.118(0.103) 0.045 25.6 1.16(0.974) 0.045 256 11.7(9.95) 
TMX 0.040 0.292 0.012(0.011) 0.048 2.92 0.141(0.117) 0.036 29.2 1.07(1.02) 0.035 292 10.3(8.63) 
cLo 0.043 0.250 0.011(0.010) 0.044 2.50 0.112(0.101) 0.043 25.0 1.08(0.868) 0.034 250 8.51(7.86) 
Data from Fig. 1. Note: ng/bee values were calculated based on the mean values consumed from the neonicotinoid-containing food tubes for each treatment (ml/bee). This calculation is the product of the ng/t 
neonicotinoid in the food solution and the amount of solution eaten (1!) per bee in 24 h. The values in parentheses in the ng/bee/24 h column are the expected values if bees had eaten from both tubes equally. T| 
value was calculated by dividing the total amount eaten for each treatment in Fig. 1c and d by 2 and using this quantity to estimate the dose. 
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Extended Data Table 5 | Repeated-measures ANOVA 


B. terrestris Water Sucrose solution 

Within subjects contrasts df F P-value df F P-value 

Stimulus 1 8.60 0.004 1 0.579 0.449 

Stimulus x bee (cov) 1 4.45 0.038 1 1.23 0.271 

Stimulus x sensillum (cov) 1 0.038 0.846 4 0.558 0.458 

Stimulus x neonicotinoid 2 0.935 0.397 2 0.287 0.752 

Error(stim) 77 86 

Between subjects contrasts df F P-value df F P-value 

Neonicotinoid 2 10.2 0.937 2 0.004 0.996 

Bee (cov) 1 0.164 0.686 1 0.871 0.354 

Sensillum (cov) 1 5.63 0.020 1 3.35 0.071 

Error 77 86 

A. mellifera Water Sucrose solution 

Within subjects contrasts df F P-value df F P-value 

Stimulus 1 95.6 <0.001 1 747 0.007 

Stimulus x bee (cov) 1 4.20 0.042 1 5.31 0.023 

Stimulus x sensillum (cov) 1 0.303 0.583 1 0.142 0.707 

Stimulus x neonicotinoid 2 2.38 0.096 2 3.00 0.053 

Error(stim) 144 127 

Between subjects contrasts df F P-value df F P-value 

Neonicotinoid 2 1.23 0.295 2 6.70 0.002 

Bee (cov) 1 0.335 0.563 1 1.67 0.198 

Sensillum (cov) 1 1.37 0.244 1 12.6 0.001 

Error 144 127 
Data from Fig. 2. Note: for ‘Water’ model, the stimulus variable included: sucrose, KCl, nicotine, water, 1 14M, and 1 mM neonicotinoid. For the ‘sucrose solution’ model, the stimulus variable included: sucrose, 1 nM, 
100 nM, and 1 uM neonicotinoid. The significant ‘stimulus x neonicotinoid’ term in the sucrose solution experiment for honeybees reflects a slight adaptive effect that occurred in the experiments with IMD, but not 
with TMX or CLO. Pairwise comparisons of each stimulus applied in the IMD experiment revealed that the 1 1M IMD and the final sucrose control stimulus produced fewer spikes than the first sucrose stimulus 


(P = 0.024 and P= 0.002). However, the 1 4M IMD and the final sucrose stimulus were not significantly different (P = 0.546) indicating either that the neurons in these experiments exhibited a slight adaptation 
effect or that the 1 1M IMD concentration had a toxic effect that influenced the integrity of their responses to sucrose. 
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Seed coating with a neonicotinoid insecticide 
negatively affects wild bees 


Maj Rundléf", Georg K. S. Andersson'?, Riccardo Bommarco’, Ingemar Fries®, Veronica Hederstrém!, Lina Herbertsson?, 
Ove Jonsson*’, Bjorn K. Klatt”, Thorsten R. Pedersen®, Johanna Yourstone! & Henrik G. Smith!” 


Understanding the effects of neonicotinoid insecticides on bees is 
vital because of reported declines in bee diversity and distri- 
bution’” and the crucial role bees have as pollinators in ecosystems 
and agriculture*. Neonicotinoids are suspected to pose an unac- 
ceptable risk to bees, partly because of their systemic uptake in 
plants*, and the European Union has therefore introduced a mora- 
torium on three neonicotinoids as seed coatings in flowering crops 
that attract bees®. The moratorium has been criticized for being 
based on weak evidence’, particularly because effects have mostly 
been measured on bees that have been artificially fed neonicoti- 
noids*’. Thus, the key question is how neonicotinoids influence 
bees, and wild bees in particular, in real-world agricultural land- 
scapes''"*. Here we show that a commonly used insecticide seed 
coating in a flowering crop can have serious consequences for wild 
bees. In a study with replicated and matched landscapes, we found 
that seed coating with Elado, an insecticide containing a combina- 
tion of the neonicotinoid clothianidin and the non-systemic pyre- 
throid f-cyfluthrin, applied to oilseed rape seeds, reduced wild bee 
density, solitary bee nesting, and bumblebee colony growth and 
reproduction under field conditions. Hence, such insecticidal use 
can pose a substantial risk to wild bees in agricultural landscapes, 
and the contribution of pesticides to the global decline of wild 
bees'* may have been underestimated. The lack of a significant 
response in honeybee colonies suggests that reported pesticide 
effects on honeybees cannot always be extrapolated to wild bees. 

Neuroactive neonicotinoids are commonly used in seed coatings to 
control herbivorous insect pests in a variety of crops such as corn, 
cereals and oilseed rape and are taken up systemically by the growing 
plant and distributed to all tissues’. These chemicals account for more 
than one fifth of the world’s insecticide market", and this widespread 
use requires that their effects on non-target organisms are investigated. 
A particular concern is the effect of neonicotinoids on bees®*!”, because 
of the bee’s role in pollinating crops* and declines in bee diversity and 
distribution’. 

These concerns, together with research indicating negative effects of 
neonicotinoids on bees, have led to a European Union-wide restriction 
from 1 December 2013 on the use of the three neonicotinoids, clothia- 
nidin, imidacloprid and thiamethoxam, as seed coating in crops 
attractive to bees®, to allow for studies on their environmental effects. 
Previous studies have mainly focused on the effects of neonicotinoids 
on bees artificially exposed to neonicotinoids*"’, mostly on honey- 
bees''. The key question is how wild bees, which may differ from 
honeybees in response to insecticides’*""’, are affected by neonicoti- 
noids when foraging in real agricultural landscapes’. 

Here we investigated how seed coating oilseed rape with Elado 
(Bayer), including the systemic neonicotinoid clothianidin® and the 
non-systemic pyrethroid {-cyfluthrin’® as active ingredients, influ- 
enced wild and managed bee species in Swedish agricultural land- 
scapes. Because we assessed effects on bees under field conditions, 


our findings have important implications for policies regulating the 
use of neonicotinoids as well as for risk assessments of pesticides. 
We designed a study with eight pairs of landscapes surrounding 16 
geographically separated (>4 km) spring-sown oilseed rape fields 
(Fig. 1 and Extended Data Table 1). One field in each pair was ran- 
domly assigned to be sown with seeds coated with the dose of Elado 
recommended by the manufacturer and a fungicide, while the other 
field in each pair, the control field, was sown with seeds coated only 
with the fungicide. At these 16 fields we estimated: (1) the density of 


Figure 1 | Paired design with replicated landscapes. Location of the study 
area in southern Sweden and the eight pairs of landscapes (P01-P08) centred 
on oilseed rape fields sown with insecticide-coated (open circles) or untreated 
(control fields, filled circles) seeds. Pairing was based on land use within a 2-km 
radius surrounding the oilseed rape fields and geographical proximity between 
fields. 


1Lund University, Department of Biology, 223 62 Lund, Sweden. Lund University, Centre for Environmental and Climate Research, 223 62 Lund, Sweden. ?Swedish University of Agricultural Sciences, 
Department of Ecology, 750 07 Uppsala, Sweden. “Swedish University of Agricultural Sciences, Department of Aquatic Sciences and Assessment, 750 07 Uppsala, Sweden. Swedish University of 
Agricultural Sciences, Centre for Chemical Pesticides, 750 07 Uppsala, Sweden. °Swedish Board of Agriculture, 551 82 Jonképing, Sweden. 
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wild bees; (2) the nesting activity of the solitary bee Osmia bicornis L.; 
(3) the colony development of the bumblebee Bombus terrestris L.; and 
(4) the colony strength of the European honeybee Apis mellifera L. 

Our first finding was that the insecticide seed coating reduced the 
density of wild bees, that is, bumblebees and solitary bees, in the 
flowering oilseed rape fields and adjacent uncultivated field borders 
(generalized linear mixed model (GLMM), F,,7 = 9.68, P = 0.019; 
Fig. 2a and Extended Data Table 4). Wild bee density also increased 
with the size of the focal oilseed rape field, most probably because 
larger fields attract more bees or support larger colonies’, but was 
not significantly related to the proportion of agricultural land in the 
surrounding landscape (Extended Data Table 4). Flower cover (num- 
ber and size of flowers) of the oilseed rape had a positive influence on 
wild bee density (F),.4 = 18.57, P < 0.001) and was higher in fields 
sown with insecticide-coated seeds (Extended Data Table 5). However, 
the negative impact of the seed coating on wild bee density persisted 
irrespective of whether (F,,7 = 9.68, P = 0.019; Extended Data Table 4) 
or not (F1,6 = 6.36, P = 0.044) flower cover was included as a covariate 
in the statistical model. 

Our second finding was that the insecticide seed coating correlated 
with reduced nesting of the solitary bee O. bicornis. To investigate this 
we placed three trap nests containing 27 O. bicornis cocoons (Extended 
Data Fig. 1) adjacent to each of the 16 fields before the beginning of 
oilseed rape flowering and monitored if emerging females started to 
build brood cells. In six of the eight control fields, but in none of the 
fields treated with the insecticide seed coating, females started to build 
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Figure 2 | Bee density and reproduction. a-d, Mean (+ 95% confidence 


limits) number of wild bees (solitary bees and bumblebees) per 467 m’ oilseed 
rape field and adjacent border (generalized linear mixed model (GLMM)) 
(a), median number of tubes per field with O. bicornis brood cells (Wilcoxon 
test) (b), mean (+ 95% confidence limits) number of B. terrestris queen (filled 
circles, GLMM) and worker/male (open circles, linear mixed model (LMM)) 
cocoons per colony (c), and mean (+ 95% confidence limits) number of adult 
A. mellifera per colony (colony strength) after placement at the fields (LMM) 
(d) in relation to treatment (control or insecticide seed coating) in the oilseed 
rape fields. n = 8 fields per treatment. Means and confidence limits are based on 
back-transformed, model-estimated least square means. In panel b, horizontal 
line in the box, open diamond symbols, boxes and whiskers indicate median, 
mean, 25th-75th percentiles and minimum-maximum, respectively. NS, not 
significant (P > 0.05); *P < 0.05, **P < 0.01. 
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brood cells (Wilcoxon test Z = 2.84, P = 0.0045; Fig. 2b). Although the 
reasons why the bees failed to build brood cells when exposed to the 
insecticide treatment remain unclear, a reduced capacity to nav- 
igate®’?°”? is a possible explanation. 

Our third finding was that the insecticide seed coating was nega- 
tively related to colony growth and reproduction of the bumblebee B. 
terrestris. Bumblebees are social and form colonies of one queen and 
tens or hundreds of workers. At each of the 16 oilseed rape fields we 
placed six commercially reared B. terrestris colonies (Extended Data 
Fig. 1). During their development, the bumblebee colonies are 
expected to grow in weight and worker force, and thereafter to switch 
to producing new queens and males with a resulting decline in colony 
weight". The seed-coating treatment influenced the weight change of 
B. terrestris colonies (linear mixed model (LMM), day X day X treat- 
ment F)19 = 130.62, P< 0.001, day X treatment F),2; = 143.00, P< 
0.001; Extended Data Table 6 and Fig. 3). As expected, B. terrestris 
colonies at control fields had an initial growth and a following decline 
(day X day F,,23 = 114.70, P < 0.001, day F,,3; = 129.10, P< 0.001), 
while those at fields with insecticide seed coating had a considerably 
smaller weight change (F114 = 10.78, P = 0.0055, Fy16 = 0.92, P = 
0.35) (Extended Data Table 6 and Fig. 3). While the initial colony 
weight was the same in the two treatments (Extended Data Table 5), 
the rate of weight gain of colonies at fields with insecticide-coated seeds 
was lower than that of colonies at control fields (F),7 = 115.80, P< 
0.001; Extended Data Table 5). Effects of the treatment on colony 
development may result both from reduced pollen foraging efficiency 
and insufficient care for the brood*”’**. Bumblebees have an annual 
life cycle where only the new queens produced at the end of the season 
hibernate and form new colonies the following spring. At the end of 
our experiment, fewer queen (GLMM, F,,7 = 7.78, P = 0.027) and 
worker/male cocoons (LMM, F;,,7 = 8.09, P = 0.025) were produced at 
treated fields compared to control fields (Fig. 2c and Extended Data 
Table 5). These findings are in line with the reduced colony growth and 
85% reduction in queen production reported for bumblebee colonies 
artificially exposed to imidacloprid under otherwise realistic condi- 
tions®”®. 

Our fourth finding was that the insecticide seed treatment had no 
significant influence on honeybee colony strength. In contrast to the 
B. terrestris colonies, the A. mellifera colonies did not differ in strength 
(number of adult bees) between the treatments after placement at the 
oilseed rape fields (LMM, F),7 = 0.01, P = 0.94; Fig. 2d). This finding is 
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Figure 3 | Bumblebee colony development. Mean (+ 95% confidence limits) 
bumblebee colony weight change (g) per field and survey day since day of 
placement at the fields (dashed horizontal reference line indicates initial colony 
weight) in relation to treatment (control (filled circles) or insecticide seed 
coating (open circles)). n = 8 fields per treatment. Dots are means of the six 
colonies at each field and weighing occasion. Two colonies at different fields 
(one control field and one treated field) were not weighed at one occasion, 
resulting in five colonies at those fields and weighing occasions. See Extended 
Data Table 6 for results from the colony growth model (linear mixed model). 
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Table 1 | Clothianidin concentrations in bee-collected pollen (ng g~7) and nectar (ng ml~), and field border plants (ng g~*), and tests of 
differences between treatments (control or insecticide-coated seeds) 


Control Insecticide seed coating Wilcoxon test for difference 
between treatments (n = 8*) 

Range Mean + s.e.m. Range Mean + s.e.m. J P 
Honeybee pollen 0 0 6.6-23 13.9+18 —3.16 0.0016 
Honeybee nectar 0-0.61 0.1 = 0.1 6.7-16 1032 13 —3.40 <0.001 
Bumblebee nectar 0) 0) 1.4-14 54+ 1.4 —3.53 <0.001 
Field border plants (S2 days after sowing) O 0 0-5.9 12+08 =2.90 0.0037 
Field border plants (2 weeks after sowing) No material collected 0-6.5 10+08 


*n= 6 for pollen collected by honeybees at control fields, because no such bees with pollen could be found at two fields; and n= 7 for field border plants collected within 2 days of sowing in both treatments, because 
of lack of communication regarding the sowing date between the farmer and the investigator collecting the samples. 


in line with another field study” and previous work suggesting that 
honeybees are better at detoxifying after neonicotinoid exposure com- 
pared to bumblebees’’. However, the lack of short-term effects does 
not preclude the existence of long-term effects of neonicotinoids”’. 

Mass-flowering crops are valuable food resources for wild bees’”*, 
but may act as ecological traps if foraging bees are exposed to pesticides 
such as neonicotinoids. To estimate exposure we assessed the transfer 
of clothianidin from plant to bee by first estimating the proportion of 
oilseed rape pollen collected by all three bee species, O. bicornis, B. 
terrestris and A. mellifera (Extended Data Table 6) and then quantify- 
ing the concentrations of clothianidin in bee-collected pollen and 
nectar (Table 1). 

For O. bicornis, we found oilseed rape pollen in nine of 17 examined 
cells, accounting for 35.1 + 17.0% (mean + s.e.m.) of the collected 
pollen (Extended Data Table 5). Because there was no nesting activity 
at fields with insecticide-treated seeds, we could not assess pollen 
collection there. For B. terrestris, we found that in the 47 pollen sam- 
ples collected from bees foraging in the oilseed rape fields, 80.1 + 5.0% 
of the pollen was from oilseed rape, with similar results for both treated 
and control fields (Extended Data Table 5). For A. mellifera the pollen 
extracted from pollen traps mounted on the hives contained on aver- 
age 57.8 + 5.0% oilseed-rape-type pollen, with similar proportions for 
both treated and control fields (Extended Data Table 5). 


assessment standards, where predominantly short-term and lethal 
effects are assessed on model species under laboratory conditions”, 
can be used to predict real-world consequences of pesticide use for 
populations, communities and ecosystems”””°. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Study design. The design initially included 20 fields matched into pairs based on 
land use within 2 km (Extended Data Table 1), to cover the foraging distance of 
most bees*!*’, and geographical proximity. One field in each pair was randomly 
assigned to be sown with insecticide-coated seeds and the other field was the 
control field. The matching into field pairs was based on available land-use data 
for 2011, and the landscapes surrounding the selected oilseed rape fields were 
inspected for presence of flowering crops (including other spring-sown oilseed 
rape fields) during 27-28 May 2013. At the same time, establishment and growth 
stages (using the BBCH scale (Biologische Bundesanstalt, Bundessortenamt und 
Chemische Industrie)**) of the oilseed rape plants in the focal fields were inspected. 
After the field inspections, three fields were excluded from the study, because there 
were four (total 20.9 ha), five (22.6 ha) or five (46.2 ha) additional spring-sown 
oilseed rape fields within 2 km from our focal field that, since our study was 
conducted before the moratorium®, may have been additional sources of neoni- 
cotinoid exposure. One field was excluded because a red clover seed field, known to 
be very attractive to foraging bumblebees and influence their density in the sur- 
rounding landscape”’, occurred adjacent to the focal field. In two cases, we decided 
to accept a single other oilseed rape field located at distances of 0.9 km (6.5 ha) and 
1.0 km (4.4 ha) from the planned location of our bee colonies, to retain as many 
replications of fields as possible. At this point, the study design included six 
original field pairs and four fields which had lost their pair field. After reviewing 
land use in the surrounding landscapes and in geographical proximity of the four 
unpaired fields, we decided to match these into two new pairs (P07 and P08 in 
Fig. 1). The final study system included 16 spatially separated (>4 km) spring- 
sown oilseed rape fields (mean + s.e.m. field size 8.9 + 1.4 ha, range 4-27 ha, with 
all fields but one control field in the range 4-11 ha) located in southern Sweden 
(Fig. 1). The landscapes surrounding the fields were distributed along a gradient in 
the proportion of agricultural land, ranging from 6-88%, and the land uses con- 
sidered often co-varied (Extended Data Table 1). 

The field in each pair that was randomly assigned to be sown with insecticide- 
coated seeds received seeds treated with 25 ml Elado (Bayer; 400 g1~ ! Clothianidin 
+ 80g] | B-cyfluthrin) per kg of seed and the fungicide thiram, and the other field 
in the pair was sown with seeds coated with only thiram (the control). Elado 
instead of only clothianidin was used, because the pesticide combination was an 
agronomically realistic scenario for clothianidin use in Sweden and in other parts 
of Europe™. The clothianidin is taken up by the plant, distributed to all parts and 
protects the whole plant from pest attack’, while the non-systemic B-cyfluthrin is 
intended to protect seeds and roots and only a very small amount is found in the 
aboveground parts of the plant (<0.5% of applied)’*. We did not detect any 
B-cyfluthrin in pollen collected by honeybees at fields with insecticide seed coating 
(Extended Data Table 8). Fungicides alongside neonicotinoids have frequently 
been used in coating oilseed rape seeds (A. Gunnarson, personal communica- 
tion)****. Since our study was conducted before the moratorium®, no approval 
for the use of clothianidin-dressed seeds had to be obtained. 

All experimental fields were sown with the hybrid oilseed rape cultivar Majong. 
The amount of seeds sown was 150 plants per square metre, which is the recom- 
mended seeding rate for a spring-sown hybrid’’, and corresponds to 7.5 kg ha’ 
for thiram-treated seeds and 7.7 kg ha ' for Elado + thiram-treated seeds. Sowing 
time was chosen and carried out by each farmer during the period 6 April to 18 
May 2013 (Extended Data Table 2). In two of the pairs, the treated fields were sown 
considerably earlier (both 21 April) than the control fields (6 and 7-8 May), 
resulting in a phenological asynchrony between the fields in these pairs. 

Farmers were not allowed to use other neonicotinoids in the fields, but they 
could use the non-neonicotinoid compounds Avaunt (active ingredient: indoxa- 
carb), Mavrik (active ingredient: t-fluvalinate), Plenum (active ingredient: pyme- 
trozine) and Steward (active ingredient: indoxacarb) to control pollen beetles 
(Extended Data Table 3). Nevertheless, in one case, at a control field, the farmer 
applied Biscaya (Extended Data Table 3), where the active ingredient is the neo- 
nicotinoid thiacloprid. Thiacloprid has lower acute toxicity for bees than clothia- 
nidin, imidacloprid or thiamethoxam’*”*”> and excluding this field did not 
qualitatively influence the effect of the insecticide seed treatment on the bees 
(Extended Data Tables 4-6). 

Wild bee monitoring. Wild bees and flower cover were surveyed on three occa- 
sions in the flowering oilseed rape fields and adjacent uncultivated field borders, 
between 17 June and 16 July 2013 (Extended Data Table 2). Four in-field transects 
of 2 X 25 m located 2-4 m from the edge of the oilseed rape field were surveyed 
twice (18 June to 12 July and 27 June to 16 July). Transects of 2 X 300 m located at 
the outer 1-m edge of the oilseed rape field and 1 m of the adjacent, uncultivated 
field border were surveyed once (17 June to 8 July). Border transects within a field 
pair were surveyed on the same or subsequent days for the six phenologically 
synchronous field pairs and at peak flowering at the fields in the two asynchronous 
field pairs. For in-field transects, at least one of the survey occasions was performed 
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within subsequent days for fields within a pair for all pairs but one (P04), and the 
other survey at peak flowering within the individual fields (Extended Data Table 
2). Surveys were only conducted on warm days with no rain and light winds (<7 m 
s_'). The observer covered approximately 10 m? of transect per minute. All flower 
visiting and flying solitary bees and bumblebees within the transects were noted 
and determined to species, genera or taxonomic group (Extended Data Table 7), 
using the entomological collection at Lund University, and refs 38, 39 and 40. 
Bumblebees belonging to the B. terrestris complex, including B. terrestris, Bombus 
lucorum, Bombus magnus and Bombus cryptarum, could not be separated and 
were treated as one group (B. terrestris ag.). Flower cover was calculated based on 
measurements of the number and size of flowers within transects. 

Solitary bee nesting. Three trap nests (CJ Wildlife), each containing 29 paper 
tubes with an inner diameter of 6 mm and nine O. bicornis (previously Osmia rufa) 
cocoons (four females and five males), in total 27 cocoons (12 females and 15 
males) were placed at each field approximately a week before the latest field within 
a pair was estimated to start flowering (equivalent to stage 55-63 on the BBCH 
scale, where stage 55 corresponds to individual buds being visible but still closed 
and stage 63 corresponds to the time when 30% of the flowers on the main raceme 
has opened*), between 10 and 24 June 2013 (Extended Data Fig. 1 and Extended 
Data Table 2). After emergence from the cocoons, O. bicornis individuals mate and 
the female starts to build cells where she places her eggs on collected pollen’. 
Emergence was the same in both treatments (Extended Data Fig. 1a). Females 
prefer to return to and build cells in their natal nest, over new equivalent nest 
cavities’, and there is indication that nest site availability is limiting populations 
in current agricultural landscapes“. 

The cocoons originated from the study region. We artificially delayed emer- 
gence by about a month, by storing cocoons at 2-5 °C, to match the phenology of 
the spring-sown oilseed rape. In our study region in southern Sweden, observa- 
tions of the species since 1990 indicate May (255 observations) to be the main 
activity period of O. bicornis, followed by April (94), June (83), July (2) and March 
(1)* (access date 9 February 2014, species: “Osmia bicornis’, region: “Gétaland”, 
period: “1990-2014”, “March”, “April”, “May”, “June”, “July” and “August”). 
This indicates that most of the O. bicornis at our study fields likely originated 
from the introduced population. Placement at the fields occurred on the same day 
at fields within a pair (Extended Data Table 2). Trap nests were mounted on poles 
in the field borders, approximately 50 m apart, facing southwards and with shel- 
tering vegetation at the northern side. 

Nesting tubes were collected 36-43 days after installing the cocoons. Nesting 

activity was determined in October 2013 by counting the number of tubes with 
brood cells. Where nesting activity occurred, O. bicornis built 4-34 brood cells in 
total per field (3.5 + 0.3 (mean + s.e.m.) cells per nest and field), distributed over 
1-9 tubes. Proportion emerging from the cocoons was determined by counting the 
number of open cocoons. The pupa was considered dead if the cocoon was intact 4 
weeks after placement at the fields. 
Bumblebee colonies. Six commercially reared Bombus terrestris colonies 
(Natupol N, Koppert Biological Systems) were placed at each field at the onset 
of oilseed rape flowering, between 14 and 28 June 2013. At this time, the colonies 
were approximately 10 weeks old and contained one queen, approximately 50 
workers and brood in all stages. Placement followed the phenology of the oilseed 
rape fields and was done on the same day in six of the pairs (or 2 days apart in one 
case) for fields within a pair (Extended Data Table 2). Placements of colonies at the 
two field pairs with asynchronous phenology were separated by 8 days between 
fields within the pairs, to follow the onset of flowering in the individual fields 
(Extended Data Table 2). Bumblebee colonies were ordered in four batches, with 
colonies from the same batch in the six synchronous pairs and from batches 
matching the individual fields for the two asynchronous pairs (Extended Data 
Table 2). Prevalence of pathogens and parasites in the colonies were not quantified 
before placement, although commercial colonies can be infested**, and this could 
add unexplained noise to our data. Colonies were placed in triplets in two venti- 
lated houses, located in a shaded part of the field borders (Extended Data Fig. 1). 
The colonies did not receive any supplementary feeding after placement at the 
fields. The inner plastic boxes and the B. terrestris colony content (bees, brood and 
nesting material) were weighed when placed at the fields and thereafter approxi- 
mately biweekly. Colonies were closed for exiting bees before weighing and each 
colony was weighed 3-5 times (including the initial weighing). Two colonies (one 
at each treatment) were not weighed at one occasion, because they could not be 
closed for exiting bees. All colonies within a field pair were terminated by freezing 
(—20 °C) at first sight of emerging new queens in any of the 12 colonies. This 
happened between 7 July and 5 August 2013, or 23-38 days after the colonies had 
been placed at the fields. At the asynchronous field pairs, the colonies were col- 
lected at different dates from fields within the pair, but were allowed an equal 
number of days from placement to termination. 
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The outer two colonies in each triplet box were examined to estimate the 
number of queen and worker/male cocoons, weight of cocoons, larvae and nest 
structure and the number of cells used for nectar and pollen storage. Separation 
between queen and worker/male cocoons were based on the lowest value between 
the peaks of the bimodal distribution of cocoon width, based on measurement of 
all cocoons from four of the colonies (Extended Data Fig. 1c). 

Honeybee colonies. Six equally sized Apis mellifera colonies were placed at each 
field (in total 96 colonies) at the onset of oilseed rape flowering, on 14-28 June 
2013 (Extended Data Table 2), containing an estimated 3,418 + 123 (mean + 
s.e.m.) adult bees per colony (with no statistical difference between treatments 
(Extended Data Table 5)). Placement at the fields followed their phenology and 
was done on the same day (or two days apart in one case) for fields in six of the 
pairs (Extended Data Table 2). At the two field pairs with asynchronous pheno- 
logy, placements were separated by seven days between fields within the pairs, 
following the onset of flowering in the individual fields (Extended Data Table 2). 

Honeybee colony strength (that is, number of adult bees per colony) was 
assessed before placement at the experimental fields, on 6-7 June, and again at 
a common over-wintering location after removal from the experimental fields, on 
29 July to 2 August, by a trained observer using the Liebefeld method’. The 
colonies were removed from the experimental fields on 2-31 July, at the end of 
oilseed rape flowering. 

The colonies were produced on 27-31 May by a professional beekeeper with 1- 

or 2-year-old queens of known descent. Colonies were equalized to include two full 
honeycombs (with bees), two combs with mainly sealed brood (with bees), one 
queen originating from the same colony as the one from which the split (newly 
created colony) was taken, bees from two combs shaken into the split, one drawn 
out empty comb and five combs with wax foundation. The queens in the splits 
were freely mated and derived from three different mother queens and consisted of 
four different groups based on queen lineage and age. Queen lineage and age were 
matched between fields within a pair, but the distribution of colonies was other- 
wise randomized. The comb size was full Langstroth, with an area of 880 cm? per 
comb side and an estimated 1.25 bees per cm” when a comb side was fully covered 
(a total of 1,100 bees per side)’. Parent colonies and the new splits were placed in a 
60 ha field of organically grown winter-sown oilseed rape after equalization and 
before placement at the 16 experimental fields, to minimize the risk of exposure to 
clothianidin and other pesticides. 
Pollen samples. To verify the use of oilseed rape by the bees, pollen samples were 
taken from pollen traps mounted on the A. mellifera colonies, from B. terrestris 
foraging in the fields and from O. bicornis brood cells. The pollen traps were 
mounted on three A. mellifera colonies and were activated during the peak flower- 
ing of the oilseed rape (stages 65-67 on the BBCH scale”’). At least 25 ml of pollen 
was collected from each field. A subsample of 15.0 g of the A. mellifera-collected 
pollen was sorted into separate samples based on colour and the separate samples 
were weighted. One to five samples from B. terrestris were collected per field (2.9 + 
0.3 (mean + s.e.m.)), giving a total of 47 samples. Pollen was collected, when 
possible, from O. bicornis larval cells, resulting in 17 samples from the six control 
fields with nesting activity. 

50-500 random pollen grains per sample were determined to have originated 

from either oilseed rape or another plant species using microscopy (10-40 
magnification) and the pollen reference collection at Department of Biology, 
Lund University. 
Neonicotinoid residues. Vegetation, pollen and nectar samples were collected to 
quantify the concentrations of clothianidin, together with B-cyfluthrin and the 
other four neonicotinoids used in Sweden (Extended Data Table 8), and to confirm 
the treatments. Samples of herbaceous material (flowers and leaves) were col- 
lected, within 2 days of sowing (7 April-20 May), every tenth metre in the transects 
used for wild bee monitoring in the permanent field borders adjacent to the oilseed 
rape fields. At the treated fields we also collected similar vegetation samples 13-15 
days after sowing (21 April-3 June). In each field, five A. mellifera with pollen loads 
were caught to collect pollen samples and at least five nectar foragers were caught 
to collect nectar from the honey stomach. At two of the control fields, no 
A. mellifera with pollen loads could be found in the oilseed rape fields. Five 
B. terrestris were caught in the flowering oilseed rape fields, brought to the labor- 
atory and nectar was extracted from the nectar stomachs of 3-5 bees per field, 
except at one control field where only one bee carried nectar. 

Nectar samples were quantitatively handled using the capillary microsampling 
technique’. Neonicotinoids were quantified using liquid chromatography 
coupled with tandem mass spectrometry. B-Cyfluthrin was quantified using gas 
chromatography coupled with mass spectrometry. See Extended Data Table 8 for 
limits of detection and quantification. 

Observer blind data collection. The people monitoring wild bees in the oilseed 
rape fields, handling the solitary bee nests, weighing and examining the bumblebee 
colonies, assessing the honeybee colony strength, and collecting honeybee pollen 


and nectar samples were blinded with respect to treatment. However, for practical 
reasons it was not possible to blind the person collecting vegetation samples in field 
borders during sowing and thereafter monitoring wild bees in the border transects 
and collecting bumblebees for pollen and nectar samples. 

Statistical analyses. All data was analysed in SAS 9.4 for Windows (SAS Institute 
Inc.). 

Wild bee densities were compared between treatments and in relation to flower 
cover, size of the focal oilseed rape field and proportion of agricultural land in the 
surrounding landscape using a generalized linear mixed model (GLMM, SAS 
PROC GLIMMIX) with Poisson error distribution and log link. Pair identity, pair 
identity < treatment and field part nested within pair identity X treatment were 
included as random factors, to account for the pairing of sites and the hierarchical 
study design. To investigate if the difference in phenology between fields influ- 
enced the difference in wild bee density between treatments, we also ran a statist- 
ical model only including temporally synchronous surveys, that is, surveys not 
more than 1 day apart for fields within a pair (Extended Data Table 2). In addition, 
to investigate if the influence of treatment was consistent for strictly wild bees, we 
ran another two models, but excluded B. terrestris ag., which could originate from 
the commercial colonies, and all bumblebees not determined to species (Extended 
Data Table 7). Results from all four analyses were qualitatively the same, except for 
flower cover, which did not relate significantly to the strictly wild bee density 
(Extended Data Table 4). GLMM with binomial error distribution and logit link 
were used to test the difference in flower cover between treatments, both for all 
data and for only temporally synchronous surveys (Extended Data Table 5). 
Results did not differ qualitatively depending on data set used (Extended Data 
Table 5). 

Differences in emergence of O. bicornis from the cocoons between treatments, 
sexes and their interaction were tested with a GLMM with binomial error distri- 
bution and logit link. Pair identity, pair identity < treatment and sex nested within 
pair identity X treatment were included as random factors. The number of 
O. bicornis nest tubes with nesting activity was compared between treatments 
using Wilcoxon-Mann-Whitney test (SAS PROC NPARIWAY). 

An individual growth model (Extended Data Table 6) based on a linear mixed 
model (LMM, SAS PROC MIXED)* was used to test the effect of treatments on 
the weight gain of the B. terrestris colonies from placement at the fields (day = 0). 
The net weight gain was related to day, treatment, day X treatment, day X day and 
day X day X treatment. Random intercepts and random slopes for day and day X 
day were included, with the colony identity as the subject and an unstructured 
covariance matrix. Pair identity and pair identity x treatment were included as 
random factors to account for the study design. Since the individual growth model 
was complex and yielded significant two- and three-way interactions between 
treatment, we decided to: (1) analyse growth over time separate for the two treat- 
ments (Extended Data Table 6); and (2) test differences in colony growth rate 
between treatments only for the positive growth phase, identified as the period 
until the peak weight at control fields, using a LMM with estimated slope as the 
dependent variable, treatment as the independent variable and pair identity as a 
random factor. LMM (with normal error distribution) or GLMM (with Poisson 
error distribution and log link) were used to compare the number of queen and 
worker/male cocoons, weight of cocoons, larvae and nest structure and the num- 
ber of cells used for nectar and pollen storage between treatments (Extended Data 
Table 5). 

Honeybee colony strength (that is, number of adult bees per colony) was com- 
pared between treatments using a LMM. Colony strength before placement at the 
fields was used as a covariate and pair identity and pair identity X treatment were 
included as random factors. A colony that lost its queen during transport to the 
field (treated field) and swarmed colonies (eight at control fields and ten at treated 
field) were excluded from the analysis (which did not qualitatively alter the 
results). 

To investigate if the presence of other spring oilseed rape fields within 1 km 
influenced the results, B. terrestris colony growth (Extended Data Table 5), B. 
terrestris queen and worker/male production (Extended Data Table 5) and A. 
mellifera colony development (Extended Data Table 6) were analysed using the 
full data set as well as a data set where the two field pairs with other spring-sown 
oilseed rape within 1 km from one of the fields were excluded, since the other 
spring-sown oilseed rape fields were within the potential flight range of both bee 
species*’*’. The results were qualitatively the same for B. terrestris colony growth, 
weight of produced cocoons and A. mellifera colony development independent of 
including or excluding the two field pairs (Extended Data Tables 5 and 6), but 
differed for the number of B. terrestris cocoons (Extended Data Table 5). The latter 
could be a result of reduced statistical power to detect differences, since the level of 
replication is reduced from eight to six when excluding two of the field pairs and 
queen production in particular is documented to be very variable'***°°. 
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To investigate if the Biscaya used at one control field influenced the results, we 
analysed wild bee density (Extended Data Table 4), O. bicornis nesting activity 
(results not shown), B. terrestris colony growth (Extended Data Table 5), B. ter- 
restris queen and worker/male production (Extended Data Table 5) and A. melli- 
fera colony development (Extended Data Table 6) in relation to the insecticide 
seed treatment both including and excluding the field pair where Biscaya was used 
at the control field. The results were qualitatively the same for all dependent 
variables independent of including or excluding the field pair (Extended Data 
Tables 4-6). 

The clothianidin concentrations in nectar and pollen collected by honeybees, 
nectar collected by bumblebees and field border plant material were analysed in 
relation to treatment using Wilcoxon—Mann-Whitney tests. 

Denominator degrees of freedom in the mixed models were estimated with the 

Kenward-Roger method or, when there was a negative covariance in the random 
part of the model, the containment method (constraining the variance component 
to 0), to avoid inflated denominator degrees of freedom*. Deviance from the 
assumption of normal error distribution of the LMM was tested using a 
Shapiro-Wilks test and visually assessed on residual plots. When deviance was 
detected (P < 0.05 and indicated in plots), data was either square-root trans- 
formed or a GLMM, assuming Poisson error distributions, was used. Deviance 
from the assumption of homogeneous variance between compared groups was 
tested using Levene’s test. When deviance was detected (P < 0.05), heterogeneous 
variance was modelled. Over-dispersion of the data, when the variance is consid- 
erably larger than the mean, was assessed by the ratio of the generalized ¥’ statistics 
and its degrees of freedom”’. If the ratio was larger than 1.3, an over-dispersion 
parameter (random _residual_) was added to the model. 
Power analysis. We performed a power analysis for honeybee colony strength, to 
investigate the effect size that we could potentially detect given our design and 
replication. A power analysis is conditional on the study design and the statistical 
model used to analyse the data, so we therefore used a power analysis method 
recommended for mixed models”’. With the macro (program) MixedTPower”’ we 
produced a power curve based on the honeybee colony strength model. We 
assumed « = 0.05 and then calculated power for a range of effect sizes. The effect 
size is initially expressed on the same scale as the dependent variable (that is, 
number of bees per colony; Extended Data Fig. 2a). By dividing the effect size 
with the average number of bees per colony at control sites, we obtained effect size 
expressed as the percentage change in the number of bees per colony (that is, 
colony strength) between control fields and treated fields (Extended Data Fig. 2b), 
which made it possible to compare our effect size with the effect sizes stated by the 
European Food Safety Authority °*’ and the power analysis performed by the 
Centre for Ecology and Hydrology **. Our power analysis indicated that, given 
our design, replication and data analysis method, we would be able to detect an 
effect size of just below 20% with a power of 0.8 (Extended Data Fig. 2b). This is in 
line with the estimated effect size for our level of replication given by the Centre for 
Ecology and Hydrology **. 
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Extended Data Figure 1 | O. bicornis emergence and B. terrestris colonies. 
a, Mean (+ 95% confidence limits) proportion emergence of O. bicornis from 
cocoons in relation to treatment (control or insecticide seed coating), with 
higher emergence for males than females (generalized linear mixed model, 
binomial error distribution, logit link; F,, 14 = 14.97, P = 0.0017), no difference 
between treatments (F;, 7 = 0.71, P = 0.43) and no interaction (Fj, ;4 = 0.01, 
P = 0.94). n = 8 fields per treatment, with 12 female and 15 male cocoons at 
each field. Photos (with permission; Morgan Boch): left, emerged O. bicornis 
cocoon; right, O. bicornis female at a trap nests filled with cardboard nest tubes. 
b, Mean (+ 95% confidence limits) weight of B. terrestris colonies at placement 
at the fields in relation to treatment (linear mixed model, F,, 7 = 0.99, P = 0.35). 


© Morgan Boch 


© Morgan Boch 


© Maj Ruritilof 


© Maj Rundlof 


14 


n = 8 fields per treatment, with six colonies at each field. Photos (M.R.): left, 
B. terrestris worker foraging in the oilseed rape; right, house containing three 
B. terrestris colonies. Means and confidence limits in panels a and b are based 
on back-transformed, model-estimated least square means. ¢, B. terrestris silk 
cocoon width distribution of all cocoons in four colonies (two from two 
different control fields and two from two different fields with insecticide seed 
treatment) initially examined to separate between queen and worker/male 
cocoons. Dashed vertical line indicates selected cut-off width at 12 mm (the 
lowest value between the two peaks), with queens larger (or equal) and workers/ 
males smaller. Photo (M.R.): B. terrestris colony under examination. 


©2015 Macmillan Publishers Limited. All rights reserved 


Power 
oO 
a 


0 500 1000 


Effect size (bees per colony) 


Extended Data Figure 2 | Power curves for honeybee colony strength. 

a, b, Relationship between power and effect size estimated for the honeybee 
model (Extended Data Table 6), with effect size expressed as the difference in 
honeybee colony strength (number of bees per colony) (a) and the 
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percentage change in colony strength (b) between colonies at control fields 
and at fields with insecticide seed coating after placement at the oilseed rape 
fields. Grey reference lines indicate a power of 0.8 and the corresponding 
effect size. 
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Extended Data Table 1 


2013 field size and 2011 and 2013 land use in the landscapes surrounding (radius = 2 km) the oilseed rape 


a Test of difference 
Insecticide seed : 2 
Control (n = 8) between Correlation matrix 
coating (n = 8) 
treatments 
o - 
a o 3 3 
2 § & s $3 38 
© Ss = = 
#= §,28 ¢ = & 3 
2 2 oc of 3 7 a 6 
3 ss £3 €8 © 2B Py Zim | 6o@ c 
mean + mean + 2 2 Ea Pox 8 =e Eg 42 o 8 
min-max_s.em.___min-max__F- << €5 $5 32 5S G8 ©& 55 2 5 


Size of focal oilseed rape 


field (ha) 94426 4.0-27.0 8440.9 4.0-11.0 0.1417 0.75 0.102 0.252 -0.425 0.033 -0.130 -0.174 -0.048 0.543 -0.159 0.300 
Agricultural land (%) 58.24 10.69.5882 55.8498 59-833 0.297 0.1 0.923 -0.049 0.831 0.401 0.309 0.539 0.744 -0.962 -0.083 
ag loa 38.8296 3.0-70.9 343488 03-745 0.64,, 0.45 0.338 0.592 0.173 0.334 0.675 0.870 -0.866 -0.069 
Semi-natural grassland (%) 3.1410 02-74 41412 0294 0.16,; 0.70 0.381 0.450 -0.259 -0.285 -0.337 0.082 -0.096 
rot gas a 14.2419 35-185 149423 3.2257 0.1417 0.75 0.688 0.157 0.139 0.462 -0.827 0.009 
Maize cultivation 2011 (%) 1440.5 039 17408 065 0.027 0.88 0.272 -0.243 0.107 -0.483 0.203 
Maize cultivation 2013 (%) 1340.4 036 1340.7 056 0.42;7 0.54 

ie cllseedrape —gg407 057 06402 0-15 0.05,7 0.83 0.137 0.785 -0.254 -0.217 
Spring sown oilseed rape 

2013 (%)—includingthe 1840.7 0362 1.5404 0327 <0.01,7 0.98 

focal field 

birt oilseed rape i 4408 068 1641.0 082 <0.01,7 0.96 0.566 -0.455 -0.163 
Winter sown oilseed rape 

ani (i) 15407 052 25412 086 0.34,, 0.58 

oe 2013, g2428 03236 75421 08-178 0.0117 0.93 

Forest (%) 25.3410.618-74.8 24.0486 05672 0.23.7 0.64 -0.693 0.116 
Urban (% 27411 086 3341.0 090 0.5317 0.49 0.026 


*Mass-flowering crops include oilseed rape (46%), potato (28%), pea (18%), bean (4%), fruit and berry cultivation (4%), and herbs and seeds (<1%). 
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Extended Data Table 2 | Phenology (date, BBCH®? and flower cover) in the oilseed rape fields and delivery, placement and survey* of bees 


Pair 
P01 
P01 
P02 
P02 
PO3 
PO3 
P04 
P04 
P05 
PO5 
P06 
P06 
P07 
PO7 
POs 
P08 


Seed 
treatment* 
contr 
treat 
contr 
treat 
contr 
treat 
contr 
treat 
contr 
treat 
contr 
treat 
contr 
treat 
contr 
treat 


Sowing date 
23 April 2013 
28 April 2013 
7-8 May 2013* 
21 April 2013* 
18 May 2013 
11 May 2013 
6 May 2013* 
21 April 2013* 
29 April 2013 
25 April 2013 
1 May 2013 
25-26 April 2013 
4 May 2013 

2 May 2013 

6 April 2013 
16 April 2013 


Date 
placement 
Osmia 
bicornis 
(oilseed rape 
growth stage 
(BBCHh)) 

13 June (59) 
13 June (57) 
13 June (50) 
13 June (61) 
24 June (52) 
24 June (57) 
13 June (50) 
13 June (61) 
15 June (57) 
15 June (61) 
13 June (57) 
13 June (53) 
15 June (55) 
15 June (57) 
10 June (61) 
10 June (61) 


Bombus 
terrestris 
delivery date 
(batch) 


18 June (2) 
18 June (2 
20 June (3 
18 June (2 
25 June (4 
25 June (4 
20 June (3 
18 June (2) 
18 June (2) 
18 June (2) 
18 June (2) 
18 June (2) 
20 June (3) 
20 June (3) 
11 June (1) 
11 June (1) 


) 
) 
) 
) 
) 
) 


Date 
placement 
Bombus 
terrestris 
(oilseed rape 
growth stage 
(BBCH)) 

20 June (65) 
20 June (61) 
26 June (63) 
18 June (63) 
28 June (60) 
28 June (61) 
26 June (65) 
18 June (63) 
20 June (63) 
18 June (63) 
19 June (63) 
19 June (63) 
24 June (63) 
24 June (64) 
14 June (65) 
14 June (63) 


Date 
placement 
Apis mellifera 
(oilseed rape 
growth stage 
(BBCH)) 

19 June (65 
19 June (61 
25 June (63 
18 June (63 
2 July (61) 
2 July (63) 
25 June (65 
18 June (63 
20 June (63 
18 June (63 
19 June (63 
19 June (63 
24 June (63 
24 June (64 
14 June (65 
14 June (63 


) 
) 
) 
) 


) 
) 
) 
) 
) 
) 
) 
) 
) 
) 


Date wild bee 
survey border 
(oilseed rape 
growth stage 
(BBCh)) 

25 June (65) 
26 June (63) 
6 July (65) 
20 June (63) 
8 July (63) 

8 July (65) 

4 July (65) 
20 June (65) 
24 June (65) 
24 June (65) 
28 June (65) 
28 June (65) 
1 July (65) 

1 July (65) 
17 June (65) 
18 June (65) 


Date wild bee 
survey field 1 (% 
flower cover) 
1 July (52) 

1 July (95) 
28 June (58) 
19 June (90) 
12 July (33) 

8 July (53) 

7 July (56) 

19 June (89) 
24 June (21) 
24 June (57) 
2 July (74) 

5 July (89) 

7 July (26) 

7 July (87) 

18 June (43) 
18 June (14) 


*Shaded numbers are surveys selected for analysis of wild bee density data collected at the same time (that is, within subsequent days) within the field pairs. 
‘contr, control; treat, insecticide seed coating. 
{Highly asynchronous phenology of the fields within the pair. 
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Date wild bee 
survey field 2 (% 
flower cover) 
3 July (43) 

3 July (97) 

9 July (60) 
27 June (49) 
16 July (46) 
12 July (64) 

9 July (61) 

1 July (37) 

4 July (39) 

4 July (100) 
5 July (94) 

9 July (81) 

11 July (33) 
11 July (39) 
28 June (5) 
28 June (72) 
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Extended Data Table 3 


Pair 
P01 
P01 
P02 
P02 
P03 
P03 
P04 
P04 
POS 
P05 
P06 
P06 
P07 
P07 
P08 
P08 


Use of plant protection products in the oilseed rape fields during the 2013 growing season 


Seed 
treatment* 


contr 
treat 
contr 
treat 
contr 
treat 
contr 
treat 
contr 
treat 
contr 
treat 
contr 
treat 
contr 
treat 


*contr, control; treat, insecticide seed coating. 


Date treatment 1 


Compound Dose 


04 June 2013 Mavrik 

06 June 2013 Plenum 
31 May 2013 Plenum 
04 June 2014 Plenum 


no treatment 


12 June 2013 Avaunt 
16 June 2013 Avaunt 
07 June 2013 Plenum 
12 June 2013 Plenum 
30 May 2013 Plenum 
12 June 2013 Biscaya 
07 June 2013 Avaunt 
04 June 2013 Avaunt 
31 May 2013 Plenum 
30 May 2013 Avaunt 
04 June 2014 Plenum 


treatment 1 


treatment 1 


0.25 I/ha 
150 g/ha 
160 g/ha 
150 g/ha 


170 g/ha 
160 g/ha 
150 g/ha 
150 g/ha 
150 g/ha 
0.30 I/ha 
170 g/ha 
170 g/ha 
150 g/ha 
170 g/ha 
150 g/ha 


Date treatment 2 


Compound Dose 


15 June 2013 Steward 
10 June 2013 Mavrik 
10 June 2013 Steward 


19 June 2013 Mavrik 


08 June 2013 Plenum 


14 June 2013 Avaunt 
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treatment 2 


treatment 2 
85 g/ha 


0.20 I/ha 
85 g/ha 


0.25 I/ha 


150 g/ha 


120 g/ha 
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Extended Data Table 4 | Wild bee density in oilseed rape fields and borders in relation to insecticide seed treatment and covariates 


Explanatory Degrees of 
Model variable Estimate freedom FE P 
Wild bees (all data) Intercept 2.55 
Treatment 0.73 1,7 9.68 0.019 
Flower cover 1.06 1, 24 18.57 <0.001 
Field size 0.07 4,7 7.46 0.028 
Proportion 
agricultural land -1.20 1,8 2.35 0.16 
Wild bees (synchronized 
data*) Intercept 2.03 
Treatment 0.76 1,6 6.69 0.043 
Flower cover 1.32 1,29 26.56 <0.001 
Field size 0.08 17 6.46 0.038 
Proportion 
agricultural land -1.00 1,5 2.76 0.15 
Wild bees excluding Bombus 
terrestris ag. (all data) Intercept 0.79 
Treatment 1.14 1,7 12.65 0.0096 
Flower cover 1.06 1,17 8.52 0.094 
Field size 0.08 1,6 6.63 0.045 
Proportion 
agricultural land -0.33 1,7 0.20 0.67 
Wild bees excluding Bombus 
terrestris ag. (synchronized 
data’) Intercept -16.07 
Treatment 9.16 1,4 12.28 0.025 
Flower cover 2.17 Af 0.35 0.57 
Field size 1.77 ued 54.65 <0.001 
Proportion 
agricultural land 4.86 13.7 1.07 0.34 
Wild bees (excluding the field 
pair where Biscaya was used 
at the control field) Intercept 0.93 
Treatment 0.95 1,3 20.20 0.023 
Flower cover 1.18 1,15 16.29 0.0011 
Field size 0.20 1,4 10.04 0.034 
Proportion 
agricultural land -0.42 1,8 0.12 0.74 


*See Extended Data Table 2 for identification of synchronized data. 
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Extended Data Table 5 | Statistical tests and mean values for bee-related variables in relation to the insecticide seed treatment in the oilseed 
rape fields 


Degrees Insecticide seed 
of Control coating 
Dependent variable freedom F P (mean + s.e.m.) (mean + s.e.m.) 
Flower cover (%) - all data Teh 9.34 0.018 46.4473 70.2+6.5 
Flower cover (%) - synchronized data* 1,6 8.28 0.028 41.4+9.0 70.9 + 8.0 
Initial Bombus terrestris colony weight (g) 1,7 0.99 0.35 733.2 + 17.8 722.7 + 18.6 
Slope of Bombus terrestris colony growth 10 115.80 <0.001 21.3416 0.4+1.6 


Slope of Bombus terrestris colony growth - 

excluding the two field pairs with other spring 

sown oilseed rape field within 1 km 1,5 143.02 <0.001 18.94+1.1 -0.541.1 
Slope of Bombus terrestris colony growth - 

excluding the field pair where Biscaya was used at 

the control field 1,6 108.41 <0.001 22.2+1.7 0.5+1.7 


Number of Bombus terrestris queen cocoons 1,7 7.78 0.027 70.0+12.3 20.6 + 8.3 
Number of queen cocoons - excluding the two 
field pairs with other spring sown oilseed rape field 


within 1 km 1,5 3.82 0.11 59.7 + 15.8 22.0 + 9.6 
Number of queen cocoons - excluding the field 

pair where Biscaya was used at the control field 1,6 9.46 0.022 69.1 + 13.7 18.147.0 
Number of Bombus terrestris worker/male 

cocoons 1,7 8.09 0.025 241.0 + 29.8 142.0 + 29.8 


Number of worker/male cocoons - excluding the 

two field pairs with other spring sown oilseed rape 

field within 1 km 1,5 6.57 0.050 206.1 + 28.3 115.6 + 20.7 
Number of worker/male cocoons - excluding the 

field pair where Biscaya was used at the control 

field 1,6 6.74 0.041 247.6 + 33.9 144.0 + 33.9 
Weight of Bombus terrestris cocoons (g) 1,7 14.77 0.0061 172.0 + 32.3 54.0 + 18.7 
Weight of cocoons (g) - excluding the two field 

pairs with other spring sown oilseed rape field 

within 1 km 1,5 12.34 0.017 135.1 + 25.3 41.64 14.5 
Weight of cocoons (g) - excluding the field pair 


where Biscaya was used at the control field 1,6 9.62 0.021 201.1 + 32.3 69.2 + 32.3 
Weight of Bombus terrestris larvae (g) 1,7 0.15 0.71 15.5 + 6.0 13.64 5.7 
Weight of Bombus terrestris nest structure (g) 1,7 12.34 0.0098 261.0 + 24.7 139.4 + 24.7 
Number of nectar cells 1,7 2.43 0.16 59.4 + 23.7 23.5 + 10.4 
Number of pollen cells 1,7 0.60 0.46 §.5+2.1 3.641.4 
Initial number of Apis mellifera per colony 1,7 0.12 0.74 3412 + 192 3325 + 160 
Proportion oilseed rape pollen from Osmia 

bicornis (%) 35.1 + 17.0 

Proportion oilseed rape pollen from Bombus 

terrestris (%) 1,8 3.70 0.092 88.1 +5.0 74.9+47.7 
Proportion oilseed rape pollen from Apis mellifera 

(%) 1,7 1.09 0.33 52.6 + 7.2 63.1 + 6.9 


*See Extended Data Table 2 for identification of synchronized data. 
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Extended Data Table 6 | Bumblebee colony growth (net weight gain) and honeybee colony strength (adult bees per hive) in relation to 


insecticide seed treatment 


Model 


Degrees of 
Explanatory variable(s) Estimate freedom 


B. terrestris colony growth 


All fields 


Only control fields 


Only fields with insecticide seed 


coating 


A. mellifera colony strength 


All fields 


Excluding the two field pairs with 
other spring sown oilseed rape 


field within 1 km 


Excluding the field pair where 
Biscaya was used at the control 


field 


Intercept -51.07 
Treatment -434.27 
Day 0.23 
Day * treatment 72.50 
Day x day 0.08 
Day x day x treatment -1.40 
Intercept -533.40 
Day 77.59 
Day = day -1.44 
Intercept -36.53 
Day -1.61 
Day x day 0.13 
Intercept 9834.46 
Initial colony strength -0.19 
Treatment -41.51 
Intercept 9609.95 
Initial colony strength -0.18 
Treatment 199.73 
Intercept 9715.31 
Initial colony strength -0.16 
Treatment 90.68 
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1,18 


0.92 
10.78 


1.67 
0.01 


1.33 
0.11 


0.82 
0.02 


0.35 
0.0055 


0.20 
0.94 


0.26 
0.76 


0.37 
0.88 
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Extended Data Table 7 


fields 


Number of individuals of wild bee species or groups at control (n = 8) and insecticide-treated (n = 8) oilseed rape 


Insecticide 

Group Bee species Control seed coating 

solitary bee Andrena sp. 15 25 
solitary bee Colletes sp. 2 
solitary bee Hylaeus sp. 1 0 
solitary bee Lasioglossum/Halictus sp. 10 3 
solitary bee Macropis europaea 1 0 
solitary bee Nomada sp. 1 3 
solitary bee Sphecodes sp. 4 1 
solitary bee unidentified solitary bee (not including Osmia bicornis ) 10 0 
bumble bee Bombus hortorum 3 0 
bumble bee Bombus hypnorum 10 5 
bumble bee Bombus lapidarius 275 43 
bumble bee Bombus pascuorum 18 6 
bumble bee Bombus pratorum 3 6 
bumble bee Bombus ruderarius 2 2 
bumble bee Bombus soroeensis 1 0 
bumble bee Bombus subterraneus 1 0 
bumble bee Bombus sylvarum 2 0 
bumble bee Bombus terrestris/ucorum/magnus/cryptarum 712 403 
bumble bee unidentified bumble bee 190 159 
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Extended Data Table 8 | Residues of neonicotinoids (n) and a pyrethroid (p) in bee-collected pollen and nectar from control fields and fields 
sown with insecticide treated seeds 


Insecticide seed coating 


Control (n = 8 fields*) (n = 8 fields) 

Detected in Highest Detected in Highest 

nsamples concentration n samples concentration LODt Loat 
Honey bee pollen (ng/g) 
Acetamiprid (n) 1 0.34 0 0.080 0.24 
Clothianidin (n) 0 8 23 0.50 1.5 
Imidacloprid (n) 1 0.23* 0) 0.30 0.90 
Thiacloprid (n) 3 1.48 4 0.29 0.070 0.21 
Thiamethoxam (n) 0 0 0.10 0.30 
Beta-cyfluthrin (p) 0 1.0 
Honey bee nectar (ng/ml) 
Acetamiprid (n) 0 0 0.033 0.10 
Clothianidin (n) 2 0.61 8 16 0.17 0.50 
Imidacloprid (n) 3 0.35 0 0.17 0.50 
Thiacloprid (n) 2 0.358 2 0.044 0.033 0.10 
Thiamethoxam (n) 1 0.19 e) 0.17 0.50 


*n = 6 for pollen collected by honeybees at control fields, because no such bees with pollen could be found at two fields. 

+ LOD, limit of detection; LOQ, limit of quantification. 

Pollen LOD and LOQ were estimated from spiking experiments of the average sample weight of 0.056 g. 

Nectar LOD and LOQ were estimated for the 0.016 ml sample volume. 

{Sample weight of 0.091 g explains reported value slightly below the estimated limit of detection, based on a 0.056 g sample weight 
§ One oilseed rape field sprayed with Biscaya (12 June 2013), where thiacloprid is the active ingredient (Extended Data Table 3). 
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Differential DNA mismatch repair underlies 
mutation rate variation across the human genome 


Fran Supek!?? & Ben Lehner>** 


Cancer genome sequencing has revealed considerable variation in 
somatic mutation rates across the human genome, with mutation 
rates elevated in heterochromatic late replicating regions and reduced 
in early replicating euchromatin’ °. Multiple mechanisms have been 
suggested to underlie this”* "°, but the actual cause is unknown. Here 
we identify variable DNA mismatch repair (MMR) as the basis of this 
variation. Analysing ~17 million single-nucleotide variants from the 
genomes of 652 tumours, we show that regional autosomal mutation 
rates at megabase resolution are largely stable across cancer types, 
with differences related to changes in replication timing and gene 
expression. However, mutations arising after the inactivation of 
MMRare no longer enriched in late replicating heterochromatin rela- 
tive to early replicating euchromatin. Thus, differential DNA repair 
and not differential mutation supply is the primary cause of the large- 
scale regional mutation rate variation across the human genome. 

We examined 1 megabase (Mb) mutation densities along 652 fully 
sequenced human cancer genomes with >3,000 single-nucleotide 
variants (SNVs) per genome, originating from 16 tissues. This threshold 
enables more robust estimates of regional SNV densities in the examined 
samples, but it excludes cancer types with a very low mutation burden 
(Methods). Despite vastly different mutational loads between tissues of 
origin and between individual tumours", the relative regional densities 
were, overall, consistent between samples. In a principal components 
analysis (PCA), the first principal component (PC1) corresponds closely 
to the average densities over all samples (R* = 0.99) and captures 86.2% 
of the non-random variability between the 1 Mb windows (Fig. la-c). 
This estimate of baseline variability per PC (Methods) encompasses the 
non-biological sources of randomness in the data (for example, low muta- 
tion counts per bin in some cancer types) but it may also include genuine 
biological variability, if it is particular to individual tumour genomes. 
The second most prominent PCA trend (PC2, 5.9% variability; Fig. 1a, 
d) precisely captures the known hypermutation of the X chromosome 
in a subset of tumours!”. Across the 652 tumours, we estimate that a 
further 7.9% of non-random variability exists that is not explained by 
the general pattern of regional rates or by the hypermutation of X (in 
PC3-8, with 4.4% in PC3 only; Fig. la, b). 

This signal can, in part, be ascribed to tissue-specific mutation rates: 
cancer samples from 7/16 tissues were significantly shifted in the distri- 
bution of their PC3 loadings (Mann-Whitney test, false discovery rate 
(FDR) < 1%). Liver, colorectal and B-lymphocyte tumours were differ- 
entiated by lower PC3 loading scores, whereas melanoma, breast, ovarian 
and lung tumours had higher PC3 loadings (tissues highlighted in Fig. le). 
To examine if these significant regional rate changes are associated to 
DNA replication timing changes, we used Repli-Seq data from ENCODE 
cell lines as a proxy for the changes occurring in the cancers of the 
corresponding tissues. In all cases, the correlation of the cell-line-specific 
Repli-Seq signal to the cancer-type-specific 1 Mb mutation rates was most 
prominent in the cancer type matching the cell line, with a significant 
difference from the non-matching cancer types (FDR < 10%, Fig. 1f). 
Similarly, the changes in average gene expression levels in 1 Mb windows 
in tumour samples paralleled the changes in mutation rates in the same 


samples, with significantly stronger correlations for the matching cancer 
type (FDR < 10%, Fig. 1g; example in Extended Data Fig. 1a, b). 

Visualizing the cancer samples in a PC plot revealed a group of 
outliers in a low-density area of the plot with extreme PC3 and PC4 load- 
ings (box in Fig. le). These samples derive almost exclusively from 
colorectal, stomach or uterine cancers, even though most samples from 
these three cancer types clustered elsewhere on the PC plot. One feature 
particular to tumours from these three tissues is that they frequently 
display inactivation of the MMR pathway through mutation of MMR 
genes or hypermethylation of the MLH1 gene promoter'* »’. Inactiva- 
tion of MMR results in a high incidence of small insertions and dele- 
tions (indels) at simple sequence repeats—referred to as microsatellite 
instability (MSI) and used to identify MMR deficiency phenotypically— 
but also in an increased SNV load (Extended Data Fig. 1c). 

The outlier colon, gastric and uterine cancer samples on the PC plot 
(Fig. le and Extended Data Fig. 1d) were almost always MSI samples 
(phenotypically scored as MSI-high (MSI-H)), suggesting MMR defi- 
ciency as the cause of their unusual regional mutation rate variation. 

To understand how the regional mutation rate differs in these sam- 
ples, we plotted their mutation densities along the genome. The regional 
rate variation was substantially decreased in these tumours compared 
to in other microsatellite-stable (MSS) tumours from the same tissues 
(Fig. 2a—c). The change towards more homogeneous mutation rates in 
MSI cancers was significant across many chromosomal regions, with 
~72% of the genome being affected (1,977/2,748 1 Mb windows, Mann- 
Whitney test on pooled tissues, 10% FDR; examples in Fig. 2d and 
Extended Data Fig. le, f). Consistently, the regional rates in MSI can- 
cers from all three tissues correlated poorly with replication timing 
(Fig. 2e-g), gene expression levels and heterochromatin (Extended Data 
Fig. 2a-f). Moreover, the slopes of the regression lines between binned 
replication timing and SNV rates revealed consistent changes in MSI 
cancer samples when calculated separately for intergenic and for genic 
(intronic) DNA in the whole genome sequences (Fig. 2h, i). Very sim- 
ilar trends were observed in a broader set of 950 exome sequences of 
colorectal, uterine and gastric cancers (Extended Data Fig. 2g-j). 

High mutation rates in uterine and colorectal cancers can also be 
caused by inactivation of the proofreading domain of DNA polymer- 
ase €'** (PolE). Proofreading is a result of a 3'-5' exonuclease activity 
that enhances the accuracy of PolE by excising incorrectly placed nucle- 
otides during synthesis. MSS PolE tumours exhibited a significantly 
larger spread of the regional SNV density distribution than MSI tumours 
(Fig. 2a, b), even though their mutational load is typically higher 
(Extended Data Fig. 1c). Similar conclusions are reached with stomach 
cancer’ hypermutators of unknown aetiology (Fig. 2c). Thus, increased 
mutation supply does not explain the loss of regional mutation rate 
variability in MSI cancers. 

The relative frequencies of 5’ and 3’ contexts of different SNVs—the 
mutation spectrum—are informative of the mutational processes oper- 
ative in a particular cancer type’®. We observed the previously reported”” 
signatures of MMR deficiency in MSI cancers: C > T transitions in 
an NpCpG sequence context and C > A transversions at CpCpC (all 


1EMBL-CRG Systems Biology Unit, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain. *Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain. Division of Electronics, Rudjer Boskovic 
Institute, 10000 Zagreb, Croatia. *Institucid Catalana de Recerca i Estudis Avancats (ICREA), 08010 Barcelona, Spain. 
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Figure 1 | Changes in megabase-scale regional mutation rate variation 
between tumour samples. a—e, PCA of the 1 Mb regional rates of 652 
whole-genome sequences. a, Amount of variance conveyed by the 

prominent PCs. Baseline estimated by ‘broken stick’ method (Methods). 

b, Same as in a but expressed as percentage above baseline (putatively non- 
noise) variance. c, PC1 reflects average rates. d, PC2 captures the variability in 
chromosome X mutation rates. a.u., arbitrary units; Avg, average; Chr, 
chromosome. e, Tumour sample loadings on PC3/4, highlighting cancer types 
significantly shifted by PC3 (Mann-Whitney test, FDR < 1%), as well as 
stomach (STAD) and uterine (UCEC) cancer. Dashed box denotes outlying 
samples. f, Pearson correlations of the tissue specificities (TS; Methods) of 
the Repli-Seq signal in cell lines to the tissue specificities of the 1 Mb mutation 
rates in cancer types with significant PC3 shifts. q is the significance of the 
difference of the matching versus non-matching cancer types (Z-test, FDR 
corrected). g, Same as in f but for tissue specificities of gene expression in 
tumour samples. 


mutations considered strand symmetrically). In addition, we report a 
general increase in the relative frequency of transitions in MSI genomes, 
wherein A > G increases preferentially when preceded or followed bya 
C, and C > T clearly increases most in the GpCpN context (Extended 
Data Fig. 3a). 

We examined how the different mutation spectra are distributed 
across the genome in MSI samples. The signatures most characteristic 
of MMR deficiency had a considerably flatter distribution in MSI tumours 
than in MSS or PolE-mutated tumours, whereas this was less the case for 
signatures not associated with MMR deficiency (Fig. 3a, b and Extended 
Data Fig. 3b, c). Indeed, the more abundant a mutational context becomes 
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Figure 2 | Reduced regional mutation rate variability in genomes of MSI 
cancer samples. a—c, Decreased variance between mutation rates of 1 Mb 
windows in MSI samples, when compared to MSS samples (including MSI-L) 
or to ultramutated PolE/MSS samples. MSI/PolE samples are in the MSI group. 
In the case of STAD, comparison is to PolE wild-type hypermutators (hyper). 
Data points in distributions are medians of relative mutation frequencies of 
each 1 Mb window across all cancer samples in group. **P = 0.01 by F-test for 
decrease in variance. Number of tumour samples given in brackets. o, standard 
deviation. d, Relative SNV frequencies across 1Mb windows of chromosome 
1p in colorectal cancer (CRAD). Unbroken and dotted lines are the median 
across tumour samples and its 95% confidence interval, respectively. For each 
tumour sample, relative mutation frequencies are obtained by dividing by 
the mean of all 1 Mb windows. *FDR = 10% for rates significantly closer to 
unity in MSI-H samples (Mann-Whitney test). Striped bars are low- 
alignability regions (Methods). e-g, Reduced correlation of regional mutation 
rates to replication timing in MSI cancer samples. Genomic 1 Mb windows 
were pooled into five equal-frequency bins by the median Repli-Seq signal over 
11 cell lines. For each bin, median and interquartile range of relative mutation 
rates across 1 Mb windows is shown. R? values are on original (not binned) 
data. *P < 0.01 for a difference of R, after Fisher Z-transform. Prior to binning, 
cancer samples in a group were combined by taking the median of the 
relative mutation frequencies in each 1 Mb window (as shown in d). h, i, Same 
as e-g but examined separately for genic (intronic) and intergenic regions. a is 
the slope of the regression line fit to binned data. 


specifically in MSI tumours, the more uniformly it is distributed with 
respect to replication timing in MSI samples (Fig. 3c, R’ = 0.45, P< 107°) 
but not in MSS samples (Extended Data Fig. 3d, R? = 0.01). 

Notably, however, many signatures not associated with MMR defi- 
ciency do flatten to some extent in MSI tumours. This suggests that the 
residual correlation with replication timing in MSI cancers (Fig. 2e-g) 
might derive purely from the mutations originating before MMR inac- 
tivation. To test this idea, we used the proportion of mutations in MSI- 
associated contexts to sort the MSI samples by the proportion of their 
history spent in an MMR-deficient state (Methods). The proportion of 
mutations in MSI-associated contexts significantly correlates with how 
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Figure 3 | Association of mutational signatures to MSI and to 

replication timing. a, b, Relative mutation rates of example MSI-associated 
(a) or non-MSI-associated (b) contexts across genomic replication timing 
bins. Dotted lines are linear fits to the bins with slope a, a measure of 
association to replication timing. c, Association between MSI propensity 

of a mutational context (log, ratio of its frequency in MSI versus MSS 
tumours) to its replication timing slope in MSI tumours. Ts, transition; Tv, 
transversion. d, e, Association of percentage MSI-specific signatures 

(CCN > CAN + GCN > GTN + [C/T]AN > [C/T]GN) in an MSI tumour 
sample and the binned replication timing slopes for all contexts (d), or for 
various non-MSI transversions (e) in the same tumour sample. Repli-Seq slopes 
are averaged over those mutational contexts that are displayed in the respective 
panel. In all panels, mutation rates were normalized to the number of 
nucleotides at risk in a 1 Mb window before determining the slopes. See also 
Extended Data Fig. 3. 


flat the overall regional rates are with respect to replication timing in 
each sample (Fig. 3d, R? = 0.37, P= 0.0017). Importantly, this is also 
true for mutations in contexts that increase little or decrease in relative 
frequency in MSI genomes: all transversions (excluding C > A in CCN) 
and the transitions A>G in AAN and C>T in TCN (Fig. 3e and 
Extended Data Fig. 3e, f; R? = 0.38-0.45, P< 0.0015). Thus, the greater 
the proportion of its history a tumour has spent in an MMR-deficient 
state, the flatter its distribution of mutations across the genome, and this 
flattening is observed for mutations in both MSI-associated and non- 
MSI-associated contexts. 

To quantify more precisely the contribution of MMR to the observed 
regional mutation variation across the genome, we next estimated the 
time spent in the MMR-proficient and MMR-deficient states in each 
individual tumour. We used a simple model that assumes that genome- 
wide mutation rates for each mutational context are equal across samples 
but allows for a distinct rate for each context in the MMR-proficient and 
MME-deficient states (schematic in Fig. 4a, b). We employed a genetic 
algorithm-based optimization that minimizes the difference to observed 
mutational signatures (Methods). After fitting a global set of rate param- 
eters, this simple model captures 68% of the variance in differential muta- 
tional context usage across the MSI and set of MSS cancer samples from 
all three MSI-prone cancer types (Extended Data Fig. 4). Importantly, 
the deconvolution recapitulated the known consequences of MMR 
deficiency on the mutational spectrum and the MSI status of each sample 
(Fig. 4c, d). 

The model shows that the earlier MMR fails in the history ofa tumour, 
the flatter its regional rate landscape is with respect to replication timing 
(Fig. 4e, R? =0.54,P<10 4; examples in Fig. 4f). Indeed, the inferred 
time of MMR failure predicts the loss of regional rate variability better 
thana simple proportion of mutations in MSI-prone contexts (R? = 0.37, 
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Figure 4 | Inferring the time of MMR failure by a deconvolution of the 
mutational signatures. a, b, Examples illustrating the parameters estimated: 
relative mutation rates in the MSS (a) or MSI states (b), different for mutational 
contexts but constant across samples, and the relative time spent in MSS (z) 
or MSI (1 — z), which can vary across samples. TCGA-AA-3516 is a tumour 
sample identifier. c, Estimated rates for mutational contexts. Significance given 
for increase of b over a in the eight MSI contexts (Wilcoxon test). Contexts with 
different 3’ flanking nucleotides were pooled, except the C > T in NCG 
contexts; other C > T changes are labelled NCH. Ts, transitions; Tv, 
transversions. d, The estimated proportion of MSS mutations for the MSI 
versus a set of MSS samples. P value by Mann-Whitney test, two-tailed. 

e, Estimated number of mutations arising after MMR failure correlates to the 
loss of variability in regional mutation rates (slope of the relative rates across 
replication timing bins, see Figs 2 and 3) across the MSI samples. The 99% 
confidence interval (CI) of the fitted line crosses zero at <100% mutations 
post-MSI, indicating (P< 0.01) that the mutation rate landscape in 
MMR-deficient cells does not show replication-timing-associated regional 
variability. f, Relative mutation rates of chromosome 13 for the median of five 
samples with the largest per cent of post-MMR failure mutations (e, rightmost 
points) versus five MSI samples with the least per cent post-MMR failure 
mutations (e, leftmost points). Both groups consist of two CRAD, two UCEC 
and one STAD. 


Fig. 3d). In addition, the 99% confidence interval of a linear fit to the 
points in Fig. 4e crosses the horizontal zero line—corresponding to a 
fully flat regional landscape—before the point at which all mutations 
are predicted to have arrived in the MMR-deficient state. This indi- 
cates that the mutations that arose after the inactivation of MMR in 
these tumours are not distributed with the characteristic regional vari- 
ation across the genome. In the absence of MMR, mutation rates are 
not reduced in early replicating euchromatic regions compared to in 
late replicating heterochromatin. 

Through an analysis of human tumours we have shown that MMR 
is more effective in euchromatic early replicating regions of the genome, 
suppressing the accumulation of mutations in these regions. MMR is 
known to be coupled to DNA replication, with elevated repair effi- 
ciency during S phase’*”. Differences in DNA accessibility to the repair 
machinery~’~’, the coupling of this machinery to the replication fork, 
or the time available for repair might contribute to the increased effi- 
ciency of repair in early replicating euchromatin. Across cell types, most 
active genes performing essential functions are euchromatic and repli- 
cated early**. It is thus sensible to envisage that enhanced MMR in 
euchromatin is a beneficial trait and one that has been selected for 
during evolution. 
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METHODS 


TCGA genomes and calling somatic mutations. We downloaded aligned short 
reads (to hg19/GRCh37) for the available whole-genome sequences of tumours 
(n = 630) and the matched normal tissue from the The Cancer Genome Atlas (TCGA) 
repository at CGHub. We then called somatic SNVs in each tumour-normal pair 
using Illumina’s Strelka 1.0.6 workflow”. Strelka is a highly accurate caller, with a 
low false positive rate of SNVs at the default settings*”**. We further increased the 
stringency of Strelka’s post-call filtering to prevent spurious mutation calls. By default, 
Strelka requires that the overall confidence score (QSS_NT) is =15, that the frac- 
tion of filtered basecalls at the site (BCNoise) is <40%, and also that the fraction of 
reads crossing site with spanning deletions (SpanDel) is <75%. Here, we allow very 
few filtered or gapped reads at the site: BCNoise and SpanDel must both be <3% 
for the tumour sample and <10% for the normal tissue. Exceptionally, for extre- 
mely high-confidence calls (QSS_NT =45), BCNoise for the tumour sample may 
be <6%. For TCGA leukaemia (LAML), we downloaded the called somatic SNVs 
from the corresponding publication” (n = 50 samples). 

Other whole-genome sequences. We downloaded the previously called somatic 
SNVs for whole-genome sequences (n = 507) from the online supplementary mate- 
rial of Alexandrov et al.'*; these samples did not overlap the TCGA data set. Next, 
we downloaded the somatic SNVs from the whole-genome sequences in the Inter- 
national Cancer Genome Consortium (ICGC) v.15.1 database, in case the same 
genome sets were not already available in Alexandrov et al.’° or TCGA; this encom- 
passes the ICGC projects RECA-EU, MALY-DE and EOPC-DE (n = 150 gen- 
omes). Finally, we removed all samples of B-cell lymphoma from the Alexandrov 
et al.’® set due to a suspected partial overlap with a broader set in ICGC MALY-DE. 
Filtering and dividing the genome into 1 Mb windows. To rule out errors due to 
misalignment of short reads, we masked out all regions in the genome defined in 
the ‘CRG Alignability 36’ track”, requiring a 36-nucleotide fragment to be unique 
in the genome even after allowing for two differing nucleotides. Next, we masked 
out the regions in the UCSC Browser blacklists (Duke and DAC), chosen for often 
exhibiting anomalous signals in next-generation sequencing experiments (http:// 
genome.ucsc.edu/cgi-bin/hgTrackUi?g=wgEncodeMapability). Finally, we discarded 
the exons (plus the flanking 2 nucleotides) of the UCSC gene set to avoid the signal 
stemming from selection on gene coding regions. Then we divided the genome 
into non-overlapping 1 Mb windows and discarded those with <250 kb of DNA 
passing the above filters; we also exclude the remaining windows on chromosome 
Y (n= 7) as Repli-Seq data were not available for them (see later). This yielded 
2,748 1 Mb windows with an average of 664 kb highly alignable, non-blacklisted, 
non-exonic DNA per window. When determining SNV densities, the SNV counts 
in each window were divided by the effective length of the window (after masking) 
to obtain frequencies per Mb. The choice of 1 Mb resolution is a trade-off between 
the inclusion of low mutation burden tumour samples/types (better with coarser 
resolution) and the level of detail in describing the regional variability (better with 
finer resolution). Furthermore, replication timing is known to be organized in 
megabase-scale domains”*”*. 

Conditions for inclusion in the final genome set. We merged the three sets of 
genomes (TCGA, Alexandrov et al.'® and ICGC) and discarded the genomes that 
had fewer than 3,000 SNVs in the alignable regions of the genome in order to obtain 
more reliable regional mutation density estimates. This filter will completely remove 
the cancer types with very low SNV loads, such as TCGA thyroid, prostate, leu- 
kaemia, or kidney chromophobe. We also removed four cancer types represented 
by only a single genome with =3,000 SNVs (lower grade glioma (LGG) in TCGA, 
acute lymphoblastic leukaemia (ALL) and medulloblastoma in Alexandrov et al.'®, 
and prostate cancer (EOPC-DE) in ICGC). Three TCGA uterine cancer UCEC 
genomes (TCGA-AP-AOLO, B5-A1MY, EY-A1GS) exhibited a high number of 
C> A changes at low clonal frequencies, consistent with known artefacts arising 
owing to 8-oxoG in oxidized DNA samples”! and were thus excluded, arriving to 
a final set of 652 genomes and 16,962,503 SNVs in the alignable, non-blacklisted, 
non-exonic regions of said genomes (data sources listed in Supplementary Table 1). 
Microsatellite instability status of samples. The MSI-H, MSI-L or MSS status was 
taken from the clinical data files on the TCGA FTP site repository, named nation 
widechildrens.org_clinical_patient_xxxx.txt or nationwidechildrens.org_auxiliary_ 
xxxx.txt, where xxxx is one of coad, read, stad or ucec, corresponding to colon, rectal, 
stomach or uterine cancer. The methods used for PCR phenotyping are described 
in the corresponding TCGA publications'*"». Given previous reports stating that 
MSI-L samples are much more similar to MSS than to MSI-H in clinical and geno- 
mic features****, we grouped MSI-L with MSS in all analyses. 

For 10 (of 64) colorectal cancers with the whole-genome sequence available, the 
MSI annotations were not supplied in the TCGA and we thus inferred them from 
the overall load of somatic SNVs and introns called by Strelka (see earlier). In par- 
ticular, AD-A5EJ and QG-A5Z2 were putatively labelled as MSI tumours (both 
having >15,000 small indels and >40,000 SNVs) and QG-A5YW, QG-A5YX, A6- 
A56B, QG-A5SYV, A6-A566, QG-A5Z1, A6-A567 and A6-A565 were putative MSS 
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tumours (all with <1,500 small indels and <15,000 SNVs). In case of uterine 
and stomach cancers, all samples with whole-genome sequences were assigned an 
MSI/MSS label by the TCGA. 

MSI and MSS PolE mutants. The PolE status of samples was inferred by requiring 
a non-synonymous somatic mutation called in the POLE gene. Consistent with 
previous reports*’, the MSI tumours with a PolE mutation have an overall muta- 
tional load similar to MSI PolE wild-type samples, unlike the PolE-mutated MSS 
samples, which are ultramutated’*'*. We thus grouped the MSI PolE mutants with 
MSI PolE wild-type samples, except in the case of two (out of three) MSI PolE mutants 
in uterine cancer (UCEC: AX-A0J1 and AP-A051). These samples had a much higher 
mutational load than other MSI UCEC samples (leftmost UCEC columns in 
Extended Data Fig. 1c). Moreover, they also had a mutational signature not con- 
sistent with other MSIs—in particular, the relative abundance of A > G transitions 
was low in these samples (11% and 13%), much less than in ten other UCEC MSIs 
(average 34%, range 23-60%). We thus omitted these two MSI+ PolE UCEC samples 
from all analyses that involved examining mutational signatures (corresponding 
to Figs 3 and 4, and Extended Data Figs 3 and 4), thus forming the set of 24 MSI 
samples used therein, of which 10 were CRAD, 10 were UCEC and 4 were STAD. 
Cancer exome sequences. The called somatic mutations from exome sequencing 
(MAF files) were downloaded from the TCGA for the COAD/READ (CRAD), UCEC 
and STAD cancer types in October 2014. For each tumour sample (TCGA patient), 
we selected the newest available MAF that had mutation data for that sample, and 
we did not load further mutation data for that sample from other MAFs (files listed 
in Supplementary Table 1). Next, we assigned MSI-H, MSI-L or MSS status to the 
exomes from the same data sources used for the whole genomes (see earlier) and 
discarded samples for which the MSI status was not known. This left 950 samples, 
of which 195 were MSI-H, and the rest were MSS or MSI-L (pooled together). PolE 
status was not inferred for exomes. The genomic mask for the exome analysis was 
constructed differently than for whole genomes: we similarly used the ‘CRG Align- 
ability 36’ filter and the two UCSC blacklists. However, we excluded all DNA except 
the protein coding exons (but not exons of commonly mutated cancer genes, which 
were also excluded). Finally, we retained only the 1 Mb windows with at least 5 kb of 
alignable, non-backlisted exonic DNA. This reduced the initial set of 2,748 windows 
(for the whole-genome analysis) to 1,709 windows for the exome analysis. Density 
of mutations was expressed per Mb of available DNA in each window, and again 
normalized by dividing by the average of all windows in a sample. For the analysis 
where we considered each exome separately (Extended Data Fig. 2j), we limited the 
analysis to exomes with =50 SNVs in the selected genomic windows. 

Gene expression data. The expression levels in tumours were downloaded from 
the TCGA RnaSeqV2 data sets™, where they are expressed as transcript per million 
(TPM)* values for each gene. For 15 TCGA cancer types that had RnaSeqV2 data 
available, we downloaded TPM levels for those tumour samples for which we had 
called whole-genome somatic SNVs, in total 8-92 RnaSeqV2 samples per cancer 
type (average = 29). The TPM levels of genes overlapping (including partially) each 
1 Mb window were averaged for a tumour sample, and we then averaged over all 
samples of each cancer type to get the final expression level for that 1 Mb window in 
that cancer type; if lower than 0.01 TPM, it was adjusted to 0.01 TPM. The overall, 
cross-tissue 1 Mb expression levels (in Extended Data Fig. 2a—c) are then the medians 
across 15 cancer types. 

The tissue specificity (TS) of gene expression of 1 Mb windows for a particular 
cancer type (in Fig. 1g) is the log, ratio of TPM in that cancer type and the average 
TPM across all cancer types. In comparing the TS of gene expression to the TS of 
mutation rates, we limited the analyses to the 2,442/2,748 1 Mb windows that were 
at least somewhat expressed (>0.01 TPM) in at least one examined cancer type. 
Moreover, we excluded chromosome Y for consistent treatment with the replica- 
tion timing TS analysis (see later). Finally, we considered only the eight cancer types 
(seven tissues) with significant shifts in the PC3 of regional mutation rates. 
Replication timing data. We downloaded the Repli-Seq measurements” (as wavelet- 
smoothed signal*’) of ENCODE cell lines from the UCSC Genome Browser (also 
available in the Gene Expression Omnibus under accession number GSE34399). 
To avoid biasing the sample, we excluded multiple lymphoblastoid cell lines and 
retained Gm12878 as a representative. We computed the average Repli-Seq signal 
within 1 Mb genome windows of the remaining 11 cell lines, except chromosome 
Y, which was unavailable in the original data. The resulting values ranged from 
0-100, where higher values indicate earlier replication. The overall, cross-tissue 
replication timing signal (used for genome binning in Figs 2 and 3 and Extended 
Data Figs 2 and 3) is the median value across the 11 cell lines. 

The TS of replication timing in a cell line (in Fig. 1f) is the difference between the 
Repli-Seq signal of that cell line and the average signal across all cell lines. For the 
TS analysis, we considered those cell lines that (1) could be matched toa cancer type, 
based on their tissue of origin, and that (2) corresponded to one of the eight cancer 
types found to havea significant shift in PC3 of the regional mutation rates (Fig. le). 
In particular, replication timing TS in MCE7 cells served as a proxy for BRCA 
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(breast cancer), BJ cells for SKCM (melanoma), IMR-90 cells for LUSC/LUAD 
(lung cancers), HEP G2 cells for LIHC (liver cancer) and Gm12878 cells for DLBC 
(lymphoma). For consistency with the gene expression TS analyses, we considered 
the same set of 1 Mb windows (see earlier) in the replication timing TS analysis. 
Heterochromatin data. We downloaded the RoadMap epigenomics H3K9me3 
ChIP-seq signal for a diverse set of healthy tissues or cell lines from the Gene Expres- 
sion Omnibus, encompassing: GSM621651 (adult kidney), GSM537710 (adult liver), 
GSM670028 (mesenchymal stem cells), GSM772917 (CD4 naive), GSM669939 
(fetal lung), GSM450266 (H1 cell line), GSM521914 (IMR-90 cell line). In addition, 
we include the H3K9me3 levels from Barski et al.** that were previously shown to 
have a strong correlation to regional mutation rates in cancer”. We calculated the 
mean H3K9me3 signal in 1 Mb windows for each sample and trimmed the distri- 
bution at the 99.9th percentile. Then we log-transformed the values and found the 
median over eight tissues/cell lines to get the overall H3K9me3 levels used for bin- 
ning (in Extended Data Fig. 2d-f). 

Slopes over genomic bins. We used the overall, cross-tissue Repli-Seq signal to 
create five equal-frequency (same number of 1 Mb windows) genome bins for further 
analyses. Additional sets of bins were created also for the gene expression and for 
the heterochromatin signal (Extended Data Fig. 2a-f). The slope of a regression 
line fitted through the average 1 Mb relative mutation rates of each bin is a measure 
of the association of mutation rates to replication timing (Figs 2-4). This measure 
has the desirable property of being robust to differences in the overall mutation 
load between smaller groups of tumour samples or individual genomes, or between 
mutational signatures. 

For the analysis in which intergenic and genic regions were examined separately 
in MSI and MSS cancers, we used the UCSC gene set to define these regions. Since 
the gene exons had already been excluded in a genome pre-processing step, the genic 
regions effectively consist only of introns. Only for the purposes of this intergenic 
versus genic analysis, we relaxed the requirement of =250 kb (of total alignable 
DNA) per 1 Mb window to = 100 kb (either intronic or intergenic) alignable DNA 
per 1 Mb. When calculating relative mutation rates, regardless whether we were 
analysing intergenic or intronic mutations, each sample was always normalized by 
dividing by the mean of the aggregate (intergenic plus intronic DNA) SNVs rates 
across all its windows. The five Repli-Seq bins were the same as in the whole- 
genome analysis. 

Statistical analysis. PCA was performed in R 3.1.1 (R Core Team) and in XLStat 
2014.2 (Addinsoft) on the relative 1 Mb mutation rates of each tumour sample, where 
tumour samples were features (columns in data table), and 1 Mb windows were 
examples (rows). To find the relative rates, first the SNV densities per Mb of align- 
able, non-exonic, non-blacklisted DNA (see earlier) were determined for each 1 Mb 
window in every tumour sample. Then, these densities in each sample were nor- 
malized by dividing by the mean SNV density of all windows in that sample. Thus, 
the relative mutation frequencies >1 correspond to above-average SNV densities 
in a tumour sample, and <1 to below-average densities. The approximate 95% 
confidence intervals of the median (across tumour samples: Fig. 2d and Extended 
Data Fig. le, f) for the 1 Mb windows were estimated using the formula +1.58*IQR/ 
sqrt(n_samp), as defined in the R function boxplot.stats and references therein. 
In the PCA, given that the tumour samples were features (columns) and the 1 Mb 
windows were examples (rows), the PCs will be linear combinations of tumour 
samples, and the loadings of the samples on PCs 3 and 4 are shown in Fig. le. The 
expected (baseline) per cent variance in each PC stemming from noise in data was 
estimated using the ‘broken stick’ method, found to outperform related approaches”. 
The cancer types were tested for shifts in PC3 loadings using a Mann-Whitney 
test (two-tailed), where the loadings of samples in one cancer type were contrasted 
to the loadings of samples in all other types. 

Mutational signatures. As in previous work”, the mutational signatures are defined 
as relative frequencies of SNVs at different nucleotides in all possible 5’ and 3’ 
nucleotide contexts. The mutations are counted strand symmetrically; thus, six 
possible changes exist: C > G,C > A,C >T, A>T, A>Cand A >G. These are 
equivalent, respectively, to: G > C,G>T,G>A,T>A,T>GandT>C. Each 
of the six changes has four possible 5’ and four 3’ neighbouring nucleotides, which 
amounts to 6 X 4 X 4 = 96 contexts. Extended Data Fig. 3a shows their relative 
usage, as percentage of SNVs observed in each context. The MSI propensities of 
contexts (in Fig. 3 and Extended Data Fig. 3) were defined as the log, ratio of the 
absolute mutation frequency (per Mb) of a context in the MSI samples to its 
mutation frequency in the MSS samples. 

DNA word frequencies do not necessarily occur equally frequently in genomes; 
for instance, the CpG dinucleotide is rare. Moreover, they may vary in frequency 
across the genome, as is most evident in the global G+ C content variation. The out- 
come of analyses that compare how mutational context usage co-varies with rep- 
lication timing (Figs 3 and 4 and Extended Data Figs 3 and 4) may be affected by 
this. Therefore, we normalized the mutation frequencies in different contexts by 
dividing each by the number of corresponding nucleotides at risk in each 1 Mb 


window (only nucleotides passing the alignability mask described earlier). On data 
normalized thusly, we determined the strength of association to replication timing 
via the slope of the line fitted to Repli-Seq bins, as described earlier. (Of note, as 
with the full set of mutations, here also all 1 Mb window values were divided by the 
genome average before binning.) In case different contexts needed to be combined, 
the Repli-Seq slopes were determined for each context separately and then the 
slopes were averaged for the combined context. 

Determining time of MMR failure in samples. We aimed to determine the rela- 
tive time spent in the MMR-proficient and MMR-deficient states for each of the 24 
MSI samples: 10 CRAD, 4 STAD and 10 UCEC (see earlier) by deconvoluting their 
mutational signatures into MSI and MSS components. To these 24 MSI samples, 
we added a further 24 MSS samples, matched by cancer type, while excluding the 
PolE ultramutators (in CRAD/UCEC) or otherwise hypermutated (in STAD) samples. 

To reduce the number of parameters that needed to be estimated, we pooled the 
contexts with different 3’ nucleotides together (this corresponds to how bars are 
drawn in Extended Data Fig. 3a). An exception were the NCG > NTG changes (deam- 
ination of CpG dinucleotides) that were kept separate from other NCN > NTN 
changes, which are here denoted as NCH (H = A, C or T). This yields a total of 28 
contexts, for each of which we estimated a pre- MMR failure relative mutation rate 
(a) and a post-MMR failure relative mutation rate (b). The rates were assumed to 
be constant across samples and in time. Each sample may, however, spend a dif- 
ferent fraction of time in the pre- MMR-failure state (z). Thus the total number of 
parameters to estimate is 28 (a for contexts) + 28 (b for contexts) + 24 (z for MSI 
samples) + 24 (z for MSS samples) = 104. 

Fora set of 104 parameters, the expected relative frequency ofa context (ctx) ina 
sample (samp) may be calculated as (dx X Zsamp) + Vex * (1 — Zsamp)- A Measure 
of goodness of fit for a candidate set of parameters was the negative root mean squared 
difference between these expected relative frequencies and the observed relative 
frequencies of use of 28 different contexts across the 48 samples. We used a genetic 
algorithm-based optimization to maximize the fit to the observed mutational spectra, 
as implemented in the rgenoud 5.7-12 package“ in R. The parameters were at defaults, 
except max.generations = 500, gradient.check = FALSE, wait.generations = 100 
and BFGS = FALSE. The starting populations of solutions were generated with 
rnorm(1, 0.2, 0.2) for the rates a, rnorm(1, 0.5, 0.3) for the rates band rnorm(1, 0.5, 
0.3) for the times z. 

The optimization was run 100 times with different random seeds for function 
genoud. The algorithm always converged to one of two clusters of very similar solu- 
tions with very similar goodness-of-fit scores and that predicted nearly identical 
mutational spectra for the 48 tumour samples (Extended Data Fig. 4b-d). Upon 
examining the coefficients of the solutions and schematic diagrams of the accumu- 
lation of mutations in time (Extended Data Fig. 4e-j), it becomes evident that the 
two sets of solutions are equivalent, with symmetrical a and b mutation rates, and 
also the pre-MMR failure time (z) and the post-MMR failure time (1 — z) across all 
samples. As the representative solution used in further analysis, we take the median 
a, band z coefficients across solutions of the cluster in which the MSI-specific muta- 
tional signature GCH > GTH is predicted to increase in rate after MMR failure. 
In no other way was the algorithm aware of which signatures are specific to MSI 
tumours, or which of the 48 samples were MSI and which were MSS. The algorithm 
could, in principle, be used to deconvolute other mutational processes that have a 
distinct time of (de)activation during carcinogenesis. Its practical use would, how- 
ever, probably be limited to those processes that, like MSI, have a very distinct muta- 
tional signature. 
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Extended Data Figure 1 | Overall mutational burden and megabase-scale (UCEC) and stomach (STAD) cancers for visual emphasis. e, f, Relative SNV 
regional rate variability in tumour samples of MSI-prone cancer types. frequencies across 1 Mb windows of chromosome 1p in UCEC and STAD. 

a, b, Correlations of tissue specificity (TS; see Methods) in regional mutation — Unbroken and dotted lines are the median across tumour samples and its 95% 
rates of diffuse large B-cell lymphoma (DLBC) with TS of gene expression in _ confidence interval, respectively. For each tumour sample, relative mutation 
DLBC (a), or with TS of replication timing in the Gm12878 lymphoblastoid frequencies are always obtained by dividing by the mean of all 1 Mb windows. 
cell line (b). c, Overall mutational load, as SNVs per Mb of alignable genomic — MSI/PolE samples are in the MSI-H group; hyper/ultramutators are not in the 
DNA (Methods) for MSI-H, MSS (includes MSI-L), PolE mutant tumours, or MSS group. *FDR = 10% for rates significantly closer to unity in MSI-H 
otherwise hypermutated tumour samples. d, PC plot with PCs 3 and 4, asin samples (Mann-Whitney test; not applicable to STAD because of too few 
Fig. le, but showing only tumour samples for colorectal (CRAD), uterine MSI-H samples). 
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Extended Data Figure 2 | Reduced correlation of regional mutation rates to 
gene expression, heterochromatin and replication timing in genomes 

and exomes of MSI tumours. a-c, The 1 Mb windows in the genome were 
pooled into five equal-frequency bins by the average gene expression levels 
(log, transcripts per million (TPM)) in each window. The median and 
interquartile range of relative mutation rates across 1 Mb windows is shown for 
each bin. R” values were always determined on original (not binned) data. 

*P <0.01 for difference of R after Fisher Z-transform. Gene expression levels 
are medians over TPM across 15 cancer types. Relative SNV frequencies of 
each tumour sample were obtained by normalizing by the average SNV density 
of all genomic 1 Mb windows of that sample. Prior to binning the windows, 
cancer samples in a group were combined by taking the median of the relative 
mutation frequencies for each 1 Mb window, as illustrated for CRAD in Fig. 2d. 
PolE/MSI samples are in the MSI group; ultramutators are not in the MSS 
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group. MSI-L samples are pooled with MSS. d-f, Same as in a-c but for five 
heterochromatin bins (median H3K9me3 signal over eight tissues and cell 
lines). g-i, Regional mutation rates in exome sequences of a broader set of 195 
MSI-H tumour samples. The 1,709 genomic 1 Mb windows with at least 5 kb 
alignable protein-coding DNA each were grouped into five equal-frequency 
bins by the median Repli-Seq signal over 11 cell lines (Methods). Mutations 
were pooled across all samples in one cancer type with a known MSI-H or MSS 
status (Methods). a is the slope of the regression line fit to binned data. j, Slopes 
a determined for individual cancer exomes with a sufficient number of 
mutations (=50 SNVs). Number of samples n shown below each group. For 
all cancer types, MSI-H samples have significantly less negative slopes than 
MSS (P< 0.01, Mann-Whitney test, one tailed). MSI-H also includes the 
MSI-H/PolE mutant samples, and MSS includes the MSI-L samples. In the 
exome analyses, ultramutators were not considered separately. 
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microsatellite instability and to replication timing. a, Relative frequencies samples). Ts, transition; Tv, transversion. e, f, Association of per cent MSI- 
of the 96 mutation contexts (strand symmetric) in MSI versus MSS cancers; specific signatures (CCN > CAN + GCN > GTN + [C/T]AN > [C/T]GN) 


the MSS group includes MSI-L samples but not MSS/PolE ultramutators. across cancer samples and the binned replication timing slopes for two non- 
Mutations were pooled across samples of MSI-prone tissues (CRAD, UCEC MSI transition signatures in the same samples. Slopes averaged over contexts 
and STAD). b, c, Similar to Fig. 3a, b, showing two additional examples of are displayed in each plot. In all panels except a, mutation rates were 


mutational contexts with different MSI propensities and their relative mutation _ normalized to number of nucleotides at risk in a 1 Mb window before 
rates across five genomic replication timing bins. d, Lack of correlation between __ determining the replication timing slopes. 
the MSI propensity of a mutational context with its replication timing slope 
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Extended Data Figure 4 | The deconvolution of MSI mutational spectra 
robustly converges onto two equivalent solutions. a, Agreement of the 
observed relative frequencies of mutational contexts in each tumour sample 
with the predictions of model 1 (having median a, b and z coefficients across all 
solutions in cluster 1). b, Sets of best-fit solutions determined in a hundred 
optimization runs initialized with different starting conditions. The solutions 
cluster into two homogeneous clusters (Pearson R > 0.9 between >90% of 
the solutions within a cluster, in UPGMA hierarchical clustering). 

c, d, Solutions within both clusters have similar fit to observed data (c) and 


Pearson correlation 
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make extremely similar predictions for mutation spectra in tumour 

samples (d). e-h, Similar to Fig. 4a, b. Example mutation accumulation 
diagrams for two mutation contexts typical of MSI tumours, shown for an 
example MSI tumour TCGA-BR-4280 (e, g) and for an MSS tumour TCGA- 
CD-8529 (f, h). i, j, Values of the parameters in two solution clusters, with 
medians and interquartile ranges (shown as whiskers). Each solution 
encompasses 104 parameters: relative mutation rates a and b for each of 28 
mutational contexts (i), and the relative pre-MMR failure time z for each 
tumour sample of the 24 MSI and 24 MSS samples (j). 
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A direct GABAergic output from the basal ganglia to 


frontal cortex 


Arpiar Saunders’, Ian A. Oldenburg", Vladimir K. Berezovskii’, Caroline A. Johnson', Nathan D. Kingery’, Hunter L. Elliott’, 


Tiao Xie’, Charles R. Gerfen* & Bernardo L. Sabatini! 


The basal ganglia are phylogenetically conserved subcortical nuclei 
necessary for coordinated motor action and reward learning’. Cur- 
rent models postulate that the basal ganglia modulate cerebral cor- 
tex indirectly via an inhibitory output to thalamus, bidirectionally 
controlled by direct- and indirect-pathway striatal projection neu- 
rons (dSPNs and iSPNs, respectively)” *. The basal ganglia thalamic 
output sculpts cortical activity by interacting with signals from sen- 
sory and motor systems’. Here we describe a direct projection from 
the globus pallidus externus (GP), a central nucleus of the basal 
ganglia, to frontal regions of the cerebral cortex (FC). Two cell types 
make up the GP-FC projection, distinguished by their electrophy- 
siological properties, cortical projections and expression of choline 
acetyltransferase (ChAT), a synthetic enzyme for the neurotransmitter 
acetylcholine (ACh). Despite these differences, ChAT*™ cells, which 
have been historically identified as an extension of the nucleus basalis, 
as wellasChAT™ cells, release the inhibitory neurotransmitter GABA 
(y-aminobutyric acid) and are inhibited by iSPNs and dSPNs of dorsal 
striatum. Thus, GP-FC cells comprise a direct GAB Aergic/cholinergic 
projection under the control of striatum that activates frontal cortex 
in vivo. Furthermore, iSPN inhibition of GP-FC cells is sensitive to 
dopamine 2 receptor signalling, revealing a pathway by which drugs 
that target dopamine receptors for the treatment of neuropsychiatric 
disorders can act in the basal ganglia to modulate frontal cortices. 

iSPNs are the major dopamine 2 receptor (D2R)-expressing cells in 
the brain and project from dorsal striatum to GP, suggesting that the 
therapeutic effects of drugs that target D2Rs to treat schizophrenia‘, bi- 
polar disorder’ and obsessive compulsive disorder* may involve GP cir- 
cuits. GP neurons are generally described as GABAergic, spontaneously 
active, and projecting to the thalamus and all nuclei of the basal ganglia’. 
Thus the GP is thought to coordinate subcortical activity through inhi- 
bition. Nevertheless, there are ChAT* neurons in and around GP that 
project to cortex'®" and appear to be innervated by striatal projection 
neurons from dorsal striatum’*”’, despite the rarity of iSPN synapses 
at the ultra-structural level'*. In macaques, GP neurons with nucleus 
basalis-like firing properties respond to reward"*, a computation attri- 
buted to the basal ganglia’®. Furthermore, humans with GP lesions ex- 
hibit reduced metabolism in frontal cortices and psychiatric symptoms 
reminiscent of patients with frontotemporal lobe damage, consistent 
with loss of substantial extrinsic input’’. Therefore, we examined if the 
GP contains a projection system to FC that is functionally integrated 
into basal ganglia circuitry. 

Retrograde labelling with fluorescent microspheres (retrobeads) in 
mice expressing eGFP under the control of the dopamine 2 receptor 
locus (Drd2-eGFP) identified ipsilateral FC-projecting neurons within 
GP and clustered on the GP-nucleus basalis and GP-striatum borders 
(Fig. 1a and Extended Data Fig. la-c). Nearly all GP-FC projecting neu- 
rons cells express the GABA vesicular transporter (Vgat, also known 
as Slc32A 1) and synthetic enzyme GAD65 (also known as Gad2), while 
a subset (72%) also express ChAT (Extended Data Fig. 1d, e). These 
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Figure 1 | The GP and bordering nucleus basalis contain two cell types that 
project to FC. a, Left, sagittal section from a Drd2-eGFP mouse injected with 
retrobeads into FC. Center, retrobead* neurons (inset and highlighted with 
circles) in a medial section of GP. Right, retrobead * overlay from three separate 
injections, spanning ~300 tum and excluding ventral basal forebrain. ac, 
anterior commissure; ic, internal capsule; Str, striatum; VP, ventral pallidum; 
NB, nucleus basalis. b-d, Anterograde labelling of ChAT* andChAT~ GP-FC 
axons. b, Coronal sections from a ChAT’ C* mouse injected in GP (Extended 
Data Fig. 3g) with rAAVs DIO-eGFP (Cre-On) and FAS-tdTomato (Cre-Off) 
sampled from a whole-brain reconstruction. M1, primary motor cortex; Rt, 
thalamic reticular nucleus; SNr, substantia nigra reticulata. c, Anterior- 
posterior distribution of normalized ChAT* and ChAT™ cortical axon 
densities (two mice, solid and dotted lines). d, Left, GP-FC axons 

across layers in anterior M1. Right, normalized average fluorescence from 
dotted box. wm, white matter. e, ChAT” and ChAT” GP-FC cells are 
distinguishable in acute brain slices following green retrobead injection in 

FC of ChAT*“%Rosa26'!t4Tomto mice. f, Example membrane voltage (Vn) 
traces for GP-FC cells following positive (ChAT™, 1.7; ChAT”, 0.9nA) and 
negative (ChAT*, —0.2; ChAT , —0.1 nA) current injections (500 ms) to 
determine maximum firing rates and hyperpolarized membrane properties. 
Resting V,, is indicated. g, Whole-cell spontaneous firing rates (n = 45 ChAT* 
cells, n = 35 ChAT , 10 mice). *P < 0.05 (Mann-Whitney test). 
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results indicate that GP-FC projection neurons may be GABAergic but 
can be subdivided based on cholinergic marker expression (hereafter 
ChAT* and ChAT’ cells). ChAT™ cells of the ventral nucleus basalis 
also express GABAergic markers’*, suggesting that much of the basal 
forebrain cholinergic system may co-release GABA (Extended Data 
Fig. 1f, g). ChAT GP-FC cells do not express parvalbumin (Extended 
Data Fig. 1h-j), thus they are distinct from GP neurons that project to 
posterior basal ganglia nuclei’? and other non-cholinergic cortically 
projecting cells in the basal forebrain”. Analysis in a rhesus macaque 
confirmed that similar classes of neurons are conserved in primates: ret- 
rograde labelling followed by ChAT immunostaining identified ChAT — 
and ChAT" cortically projecting neurons in GP (Extended Data Fig, 2a-e). 
In both macaque and mouse, ChAT" cells were distributed around the 
GP and GP-nucleus basalis border, areas that in the mouse are exposed 
to iSPN axons (Extended Data Fig. 2f, g). 

The projection patterns of mouse ChAT” and ChAT™ GP-FC cells 
were determined by selectively expressing fluorophores in each cell class” 
and analysing 3D brain reconstructions (two mice, Extended Data Fig. 3 
and Supplementary Video 1). ChAT* and ChAT” GP cells target ante- 
rior cortices, yet arborize in different but overlapping cortical layers 
and subcortical nuclei (Fig. lb-d and Extended Data Fig. 4). In most 
cortical regions, ChAT* axons arborize in layers 1-6, most densely in 
layers 1-3, whereas ChAT axons arborize densely in layers 5 and 6 
and are absent from layer 1. In ectorhinal cortex, ChAT axons extend 
into layer 1 (Extended Data Fig. 3k). 

ChAT* and ChAT” GP-FC cells are electrophysiologically distinct 
(Fig. le-g and Extended Data Fig, 5a-f): retrobead* ChAT cells exhibit 
hyperpolarization-activated cation currents (I,,) and, compared to ChAT 
cells, have smaller somata (1,740 + 391 versus 5,139 + 547 uum?), nar- 
rower action potentials, higher maximum firing rates (~200 versus 30 Hz) 
and less spike accommodation. ChAT cells were spontaneously active 
at all ages studied, whereas ChAT“ cells become spontaneously active 
around sexual maturity (Extended Data Fig. 5g) and, once active, have 
lower firing rates (4.4 + 0.39 versus 13.3 + 1.6 Hz). Thus, ChAT expres- 
sion subdivides these putative GABAergic GP-FC neurons into physio- 
logically distinct cell types that may differentially affect cortical and 
subcortical activity during development and in adulthood. 

The influence of this projection on FC was assayed in vivo using ex- 
tracellular recordings in awake, head-fixed mice while optogenetically 
manipulating GP (Fig. 2 and Extended Data Fig. 6). Mice were habi- 
tuated to restraint and periodically pressed a lever for a water reward. 
Periods without lever presses were analysed to avoid confounds of motor 
behaviour. Pulsed optogenetic activation (473 nm, 5 ms pulses at 10 Hz 
for 3 s) of channelrhodopsin (ChR2)-transduced Vegat* neurons of GP 
but not of mCherry-transduced controls entrained firing in FC, as shown 
by the population activity during the stimulation period and imme- 
diately following individual stimuli (ChR2*, n= 90 units, 5 mice; 
mcCherry*, n = 99 units, 3 mice) (Fig. 2a and Extended Data Fig. 6a, b). 
For each unit the light modulation of the activity index (Night) was cal- 
culated: Iiignt = 0 represents no change in firing, whereas values of +1 
or — 1 indicate that all firing occurred during light stimulation or non- 
stimulation periods, respectively. Optogenetic stimulation decreased 
firing rates ~20% overall (Night = —0.11 + 0.02, P< 0.0001 Kruskal- 
Wallis test, Fig. 2a); however, individual units showed increases (~4%, 
n = 4of 90) or decreases (~42%, n = 38 of 90, P < 0.05, Student’s t-test) 
in firing rate. No significant light-induced changes were observed above 
chance in controls (n = 0 and 2 of 99 units with significantly increased 
and decreased activity, three mice) (Extended Data Fig. 6c). 

To establish if ongoing activity in GP basally sculpts FC firing, we 
suppressed GP activity using the light-driven proton pump archaerho- 
dopsin (ArchT). Constant illumination of ArchT* GP (594 nm, 3 s) in- 
creased (~26%, n = 25 of 96) or decreased (~20%, n = 19 of 96) firing 
of individual FC units (two mice, P< 0.05 Student’s t-test, Fig. 2b). 
These bidirectional changes resulted in no significant change in popu- 
lation firing rate (Qignt = 0.02 + 0.01, P> 0.05 Kruskal-Wallis test). 
Units in mCherry controls (two mice) were not modulated at a rate 
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Figure 2 | GP-FC cells modulate FC firing rates in vivo. a, b, ChR2 
stimulation or ArchT inhibition of Vgat'°"* GP somata bidirectionally 
modulates firing rates in FC. Left, schematic showing extracellular recordings 
in FC during optical stimulation of ChAT* (axons shown in magenta) and 
ChAT” (axons shown in black) in GP with pulsed 473 nm (5 ms pulses, 

10 Hz for 3s, a) or constant 594 nm (3s, b) illumination. The mixed ChAT*/ 
ChAT” GP-FC projection appears striped. Middle left, mean firing rate 
(+s.e.m.) of all FC units in response to 473 nm stimulation (a) of ChR2* (blue; 
n= 90 units, 5 mice) or control (black; n = 99 units, 3 mice) mice or 594nm 
stimulation (b) of ArchT* (yellow; n = 96 units, 3 mice) or control mCherry~ 
(black; n = 63, 2 mice) mice (ChAT’“" mice expressing mCherry in GP). 
Middle right, histogram of indices of light modulation (Kignt) of FC unit firing 
rates (f), calculated as (f right on — fright off) / (f Light On + f right o#). Red bars 
indicate significantly modulated units (P < 0.05, t-test). ChR2* stimulation 
excited 4 and inhibited 38 units; ArchT~ stimulation excited 25 and inhibited 
19 units. Insets, example units, scale bar 5 Hz/1s. Right, pseudo-coloured 
plot of changes in f for each unit, normalized to the baseline period and ordered 
by Tight. ¢ Pulsed (as in a) ChR2 stimulation of Vgat'“"* GP axons in FC 
increases firing rates on a millisecond time scale. Left, schematic showing 
optrode placement in FC used for stimulation and recording. Right, mean 
(+s.e.m.) z-scored firing rates after each 5 ms light pulse (blue rectangle) 
relative to equivalent baseline period 3s prior (ChR2*, n = 111, 5 mice; 
mCherry~, n= 92, 2 mice). d, Pulsed ChR2 depolarization of ChAT*C* GP 
somata increases firing rates in FC. Left, experimental schematic (as in 

a). Middle, histogram of hight for FC units (as in a), ChR2* excited 15 and 
inhibited 2 of 120 units from four mice. Right, pseudo-coloured plot of changes 
in firing rate for all units. 


greater than chance (m = 1 and 2 of 63 units with significant increased 
or decreased activity, respectively) (Extended Data Fig. 6d). In response 
to ArchT-mediated inhibition or ChR2-mediated excitation of GP, some 
FC units altered activity within <50 ms of light onset (Extended Data 
Fig. 6e). These gain- and loss-of-function experiments indicate that 
activity in GP rapidly, potently and bidirectionally modulates neurons 
in frontal cortex in vivo. Furthermore, local stimulation of ChR2- 
expresssing GP axons in FC using an optical fibre/electrode combina- 
tion (optrode) revealed that activity of GP-FC neurons is sufficient to 
modulate FC. Pulsed illumination of ChR2* axons did not persistently 
change FC activity, but transiently increased firing rates within 5-10 ms 
of light onset (n = 111 units, 5 mice), an effect not seen in control 
mCherry* axons (n = 92 units, 2 mice) (Fig. 2c and Extended Data 
Fig. 6f, g). 

ChAT* GP-FC axons project heavily to cortex, but sparsely within 
the basal ganglia and thalamus (Extended Data Fig. 4), such that activa- 
tion of these cells in GP should minimally engage canonical basal gan- 
glia outputs. Pulsed excitation in GP increased firing rates in the FC 
(n = 120 units, 4 mice) by 11% (hight = 0.04 + 0.01 P< 0.0001 Kruskal- 
Wallis test, Fig. 2d), bidirectionally modulating activity of a subset of 
individual units (increases: ~13%, n = 15 units; decreases: ~2%, n = 2). 
Illumination of ChAT* axons in FC using an optrode confirmed that 
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these effects were due to activation of GP-FC projection neurons: firing 
rates transiently increased 5-10 ms after light onset (n = 74 units, 3 mice) 
without persistent changes during the 3 s period of pulsed stimulation 
(Extended Data Fig. 6g, h). Lastly, ArchT mediated suppression of 
ChAT* GP neurons with constant yellow light did not significantly 
affect firing rates above chance (~ 1% increased, n = 1 of 120, four mice; 
~3% decreased, n = 3 of 120) (Extended Data Fig. 6i), possibly due to 
low basal firing rate of these cells. 

To determine the synaptic mechanisms by which ChAT™ and ChAT 
GP-FC cells modulate cortex, we examined the neurotransmitters re- 
leased by each cell type (Fig. 3 and Extended Data Fig. 7a, b). Whole- 
cell voltage-clamp recordings in acute brain slices targeted neurons 
with somata within ~150 um of ChR2* axons (Extended Data Fig. 7c). 
Pharmacology and manipulation of the holding potential assayed ChR2- 
evoked synaptic currents due to opening of ionotropic GABA, glutamate 
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Figure 3 | GP-FC cells release GABA and ACh in FC. a, b, DIO (Cre-On) or 
DO (Cre-Off) rAAV transduction in the GP of ChAT’ “°;GAD1@" mice 
targets ChR2 to ChAT* or ChAT” GP neurons. a, Top, IPSCs in a layer 6 
interneuron evoked by optogenetic activation of ChAT axons under baseline 
conditions of glutamate receptor antagonism (NBQX and CPP) and following 
co-application of (from left to right) voltage-gated Na* channel blocker 
TTX, voltage-gated K* channel blocker 4-aminopyridine (4AP) and the 
GABAag receptor antagonist SR95531. Bottom, IPSCs ina ChR2 neuron at the 
GP-nucleus basalis border following optogenetic activation of surrounding 
ChAT* cells under baseline conditions (ACSF) and following co-application of 
glutamate (NBQX & CPP) and nicotinic (MEC, MLA & DHE) receptor 
antagonists, TTX, 4AP and SR95531. b, IPSC peaks across conditions 
normalized to baseline (ChAT, n = 5 cortical interneurons; ChAT*,n=7 
ChR2~ GP/nucleus basalis cells). *P < 0.05 (Mann-Whitney test). c-g, Co- 
localization analysis of virally labelled ChAT* presynaptic axon terminals 
(PSTs) in FC using array tomography. c, Left, maximum projection 

(z = 2.17 1m) through layer 1 of FC showing GFP axon following injection of 
rAAV DIO-synaptophysin-GFP into the GP of a ChAT’ “* mouse. Right, a 
single plane of immunohistochemical labelling. d, Consecutive z planes 
illustrating synaptic marker association with GFP PSTs (arrows in c). PSTs 
abut gephyrin but not PSD-95 and contain both VGAT and VAChT punctae 
(arrowheads). e, z scores for mean marker densities within GEP* PSTs for 
real versus PST-randomized data (n = 8 stacks), indicating higher densities in 
the real data. *P < 0.001 for all stacks. f, Centroid separation of VGAT and 
VACHT punctae within the same GFP* PST (n = 1,851 comparisons). g, Top, 
maximum projection of GEP* axon in layer 1. Bottom, GFP* PSTs 
colour-coded by VGAT/VACHhT identity. 
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and ACh receptors. In a subset of neurons in FC slices (n = 5 of 94 
neurons, five mice), optogenetic activation of ChAT axons evoked 
inhibitory postsynaptic currents (IPSCs) whose properties were consis- 
tent with direct GABA release from ChR2-expressing axons and mono- 
synaptic activation of GABA, receptors (Fig. 3a, b). Similar analysis in 
subcortical slices of ChR2-expressing ChAT * axons that ramify around 
the GP-nucleus basalis border (Extended Data Fig. 7d-f) revealed excit- 
atory postsynaptic currents (EPSCs) in a small number of cells (n = 2 
of 85 cells, six mice) which were unaffected by application of glutamate 
receptor antagonists but abolished by nicotinic ACh receptor (nAChR) 
antagonists (Extended Data Fig. 7g-l). In addition, larger and more pre- 
valent IPSCs were detected (n = 7 of 85) that were insensitive to antag- 
onists of nAChR and glutamate receptors but were abolished by 
antagonists of GABA, receptors (Fig. 3a, b), consistent with direct 
release of GABA from ChAT * GP axons and monosynaptic activation 
of GABA, receptors. We conclude that both ChAT* and ChAT” GP- 
FC cells release GABA, consistent with the expression of markers for 
GABA synthesis and handling, and that ChAT * cells additionally release 
ACh. 

Recordings in acute brain slices from mice expressing GFP in cortical 
GABAergic interneurons identified the FC neurons monosynaptically 
targeted by each GP-FC cell class (Extended Data Fig. 8a-d). Activa- 
tion of ChAT axons evoked IPSCs in interneurons in layers 2/3, 5 and 6, 
as well as pyramidal neurons in layers 5 and 2/3. Activation of ChAT* 
axons evoked IPSCs in a small number of interneurons in layers 1, 2/3 
and 6 and EPSCs in interneurons in layers 1 and 6. These results sug- 
gest that ChAT GP -FC cells inhibit interneurons and pyramidal neu- 
rons across cortical layers, whereas ChAT* GP-FC cells can activate and 
inhibit cortical interneurons through the release of ACh and GABA. 

Target-specific neurotransmitter release suggests a separation of GABA 
and ACh release sites within individual ChAT™ axons in cortex. To ex- 
amine this possibility, we labelled ChAT* GP-FC presynaptic term- 
inals (PSTs) with synaptophysin-GFP and determined the proximity 
of a variety of pre- and postsynaptic proteins in ultra-thin brain slices 
by fluorescence immunohistochemistry (‘array tomography’) and cus- 
tom image analysis routines (Fig. 3c-g and Extended Data Fig. 8e-k). 
Reconstructed portions of FC (eight stacks, two mice) contained GEP* 
volumes that resembled ‘pearls on a string’. Detected GFP * ‘pearls’ had 
volumes consistent with PSTs” and co-localized with synapsin I, bas- 
soon and GAD 1/2, supporting their identity as PSTs capable of synthe- 
sizing GABA. GFP* PSTs co-localized with gephyrin but not PSD-95, 
indicating that they oppose inhibitory but not excitatory postsynaptic 
densities (Fig. 3e). The majority of GFP* PSTs immunostained for at 
least one of the GABA/ACh vesicular transporters ( = 4,274 of 6,071), 
but all combinations were observed: VAChT alone (n = 2,030), VGAT 
alone (n = 407) and both VGAT and VAChT (n = 1,837). Within indi- 
vidual PSTs expressing both vesicular transporters, the VGAT and VAChT 
punctae were separable (318 + 39 nm between centroids), suggesting 
that individual axonal boutons can co-release GABA and ACh but do 
so through distinct vesicular pools (Fig. 3f, g). 

ChAT™* GP-FC projection neurons have been previously identified 
as part of the nucleus basalis. To be included in the basal ganglia, GP- 
FC cells should receive synaptic inputs typical of GP: glutamatergic inputs 
from the subthalamic nucleus (STN) and GABAergic inputs from SPNs 
of the dorsal striatum. Indeed, STN axons arborize in the regions con- 
taining GP-FC cells and electrical stimulation of the STN-GP axon tract 
evoked glutamatergic EPSCs in ChAT* and ChAT” FC-projecting cells 
(Extended Data Fig. 9a—e). Similarly, axons of iSPNs and, surprisingly, 
dSPNs ramified around ChAT* GP neurons (Fig. 4a and Supplemen- 
tary Video 2). Optogenetic stimulation of axons of either striatal cell class 
evoked GABAergic IPSCs in nearly all GP-FC cells (iSPNs: ChAT™, 
20 of 22; ChAT”, 19 of 20 from 11 mice; dSPNs: ChAT*, 12 of 12; 
ChAT, 11 of 11 from 11 mice; Fig. 4b and Extended Data Fig. 10a, b). 
In contrast, ChAT* cells of the nucleus basalis or substantia innomi- 
nata do not receive dorsal SPN input (Extended Data Fig. 10c-e). To- 
gether these results indicate that GP-FC cells are distinct from nucleus 
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Figure 4 | The GP-FC projection is a basal ganglia output sensitive to 
antipsychotic drugs. a, Top left, sagittal section from a Drd2-Cre mouse 
injected with rAAVs DIO-eGFP (Cre-On) and FAS-TdTomato (Cre-Off) into 
dorsal striatum. Inset 1, iSPN and dSPN innervation zones in relation to 
ChAT* GP cells. Insets 2 and 3, ChAT* GP-FC clusters. b, Optogenetic 
activation of iSPNs (Adora2a-Cre;Rosa26"1 PP °Y?'* mice, top) or dSPNs 
(Drd1a-Cre;Rosa26'! PYF P/+ mice, bottom) in dorsal striatum evoked 
$R95531-sensitive IPSCs in retrobead” GP-FC ChAT* and ChAT cells 
following injection in FC. c, iSPN but not dSPN IPSCs are decreased through 
D2R activation and reversed by an antipsychotic. Mean IPSC amplitudes 
normalized to baseline. Cell numbers are indicated. *P < 0.05 versus ACSF 
(Mann-Whitney test); **P < 0.05 quinpirole versus quinpirole + sulpiride 
(paired t-test on cells with both conditions, n = 5). Error bars denote s.e.m. 


basalis neurons and are functionally integrated into basal ganglia cir- 
cuitry through direct, indirect and hyperdirect (STN-GP)” pathways. 
Although iSPNs and dSPNs both inhibit GP-FC cells, differences in 
synaptic strength and short-term plasticity (Extended Data Fig. 10b, f, g) 
suggest that dynamic activity in each pathway could differentially inhi- 
bit the GP-FC projection. 

The presence ofa GABAergic output to cortex under control of striatal 
SPNs could be important for understanding the aetiology and treatment 
of motor and psychiatric diseases**”*. For example, schizophrenia is 
genetically associated with D2Rs”, treated with drugs that block D2Rs*® 
and often manifests coincident with developmental changes in cortical 
inhibition”. Schizophrenics exhibit imbalances in GABA” and ACh” 
systems, as well as molecular and morphological changes in prefrontal 
interneurons*’. Since GP-FC cells are present in primates (Extended 
Data Fig. 2), release GABA and ACh onto interneurons (Fig. 3 and Ex- 
tended Data Fig. 8a-d) and are inhibited by D2R-expressing iSPNs 
(Fig. 4), we examined whether D2R signalling affects iSPN synapses onto 
GP-FC cells. Indeed, the D2R agonist quinpirole reduced the ampli- 
tude of iSPN-mediated IPSCs in GP-FC neurons in acute brain slices 
(Fig. 4c). This reduction was reversed by the antipsychotic sulpiride and 
was not seen in dSPN mediated IPSCs, confirming that striatal dopa- 
mine could disinhibit the GP-FC projection through antipsychotic- 
sensitive D2R signalling (Extended Data Fig. 10h). 
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The existence of GP-FC cells suggests a major revision to basal gan- 
glia models*™. Striatal direct and indirect pathways were proposed to 
exert opposite effects on cortical activity through bidirectional control 
of ascending thalamic drive. The GP-FC projections bypass thalamus, 
allowing dSPNs and iSPNs to modulate cortex in concert. Furthermore, 
the hyperdirect cortical projection through the STN excites GP—FC neu- 
rons directly, forming a two-synapse loop for recurrent cortical modu- 
lation. The effects of GP-FC neurons on cortex are complex, mediated 
by GABA and ACh acting on diverse postsynaptic targets. The context 
of GP-FC activity during behaviour, the specific identities of cortical 
targets, and the mechanisms and consequences of GABA/ACh co- 
release will be the subjects of further investigation. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Mice. Bacterial artificial chromosome (BAC) transgenic mice expressing eGFP 
under the control of the dopamine 2 receptor locus (Drd2-eGFP) were used to de- 
fine the anatomical border of the globus pallidus externus (GP) and ventral pallidum 
through the expression of eGFP in striatal isPNs (GENSAT, founder line $118). 
Cre recombinase was targeted to specific cell types of the basal ganglia using knock- 
in or BAC transgenic mice to drive Cre expression under gene-specific regulatory 
elements. Cre knock-in mice for choline acetyltransferase (ChAT)”' and Slc32a1 
(vesicular GABA transporter, also known as Vgat)** were provided by B. Lowell 
(Beth Israel Deaconess Medical Center) and are available from the Jackson Labs 
(ChAT, stock no. 006410; Vga fC stockno.016962). Gad2'“* were purchased 
from Jackson Labs (stock no. 010802)**. All knock-in mice link Cre expression to 
the gene of interest using an internal ribosome entry site. Targeting Cre expression 
in dSPNs was achieved with BAC transgenic mice expressing Cre under control of 
the dopamine receptor 1 (Drd1a) or in iSPNs with Cre under control of the aden- 
osine 2A receptor (Adora2a) or dopamine 2 receptor regulatory elements and 
obtained from GENSAT (Drdla-Cre, founder EY262, stock no. 017264-UCD; 
Adora2a-Cre, founder KG139, stock no. 031168-UCD; Drd2-Cre, founder ER43, 
stock no. 017268-UCD)***°. The ChAT-GFP BAC transgenic line used to identify 
ChAT™ neurons was purchased from Jackson Labs (no. 007902)**. To visualize the 
full processes of Cre-expressing cells, Cre mice were bred to incorporate a Cre- 
activated tdTomato reporter allele*’ (Ail 4; Jackson Labs, stock no. 007914; referred 
to as Rosa26!!t4Tomato) ‘To visualize the somata of Cre-expressing cells, the Cre- 
activated ZsGreen reporter allele was used*’ (Ai6; Jackson Labs, stock no. 007906; 
referred to Rosa26 8S"), To target channelrhodopsin-2 (ChR2) to all Cre-expressing 
cells, Cre driver mice were bred to incorporate a Cre-activated ChR2(H134R)-eYFP 
transgene**(Ai32; Jackson Labs, stock no. 012569; referred to as Rosa26lstChR2-eYFP) 
In experiments designed to identify cortical cells neighbouring ChR2* ChAT™ or 
ChAT GP-FC axonsas pyramidal or interneurons, ChAT’ mice also carried a 
Gad1"" knock-in allele® to highlight a subset of cortical interneurons synthes- 
izing GABA. Unless otherwise noted, we do not distinguish between mice hetero- 
zygous or homozygous for knock-in alleles. Wild-type mice refer to C57BL/6 obtained 
from Charles River. Transgenic mice were of a mixed genetic background. All ex- 
perimental manipulations were performed in accordance with protocols approved 
by the Harvard Standing Committee on Animal Care following guidelines described 
in the US National Institutes of Health Guide for the Care and Use of Laboratory 
Animals. 

Virus preparation. Cre-On or Cre-Off conditional expression was achieved using 
recombinant adeno-associated virus (rAAV) carrying transgenic cassettes whose 
transcription was activated or inactivated by Cre*°. Cre-On conditional expression 
of channelrhodopsin-2 (ChR2-mCherry, H134R variant), eGFP, mCherry, or 
synaptophysin-mCherry was achieved by using a double-floxed inverted open read- 
ing frame (DIO). Cre-Off conditional expression of ChR2-mCherry was achieved 
by starting the open reading frame in the non-inverted orientation with respect to 
the promoter (DO). To achieve simultaneous Cre-On eGFP and Cre-Off tdTomato 
labelling, DIO-eGFP was mixed 1:1 with FAS-tdTomato, an alternative Cre-Off 
rAAV backbone that achieves Cre-Off expression through excision of the open 
reading frame using alternative loxP sites. DIO, DO and FAS rAAVsall use the EFla 
promoter and were packaged in serotype 8 by a commercial vector core facility, 
except for DIO-ArchT-tdTomato which uses a CAG promoter and was packaged 
as serotype 5 (University of North Carolina). All rAAVs were stored in undiluted 
aliquots at a concentration >10'* genomic copies per ml at —80 °C until intra- 
cranial injection. 

Stereotaxic intracranial injections. Male and female mice (postnatal day (P) 20- 
120) were anaesthetized with isoflurane and placed in a small animal stereotaxic 
frame (David Kopf Instruments). Under aseptic conditions, the skull was exposed 
and a small hole was drilled. For rAAVs injections, 200-350 nl total volume was 
delivered bilaterally into the ventral GP/dorsal nucleus basalis or 500 nl into dorsal 
striatum through a pulled glass pipette at a rate of 200 nl min”! using a Microinject 
system (World Precision Instruments). GP injection coordinates were 0.7 mm pos- 
terior from Bregma, 2.0 mm lateral and 3.8 mm below the pia. Dorsal striatum in- 
jection coordinates were 0.9 mm anterior from Bregma, 2.2 mm lateral and 2.5 mm 
below the pia. After surgical procedures, mice received flunixin for analgesia and 
were returned to their home cage for >21 days to allow for maximal gene express- 
ion. To identify GP neurons that project to frontal cortex, 200 nl of fluorescent 
retrobeads (Red-1X or Green, Lumafluor) were injected into frontal (anterior to 
striatum) cortical areas including secondary motor (M2), primary motor (M1), pri- 
mary somatosensory (S1) and dorsal and ventral agranular insular (AID and AIV) 
cortices. Frontal cortex injection coordinates were 1.9 mm anterior from Bregma, 
1.8 mm lateral and 2 mm below the pia. Following surgery, mice received flunixin 
and were returned to their home cage for 3-9 days before experimentation. Stereo- 
taxic coordinates were adjusted slightly by age. 


Fixed tissue preparation and imaging. Mice were deeply anaesthetized with iso- 
flurane and transcardially perfused with 4% paraformaldehyde (PFA) in 0.1 M so- 
dium phosphate buffer (1 PBS). Brains were post-fixed for 1-3 days, washed in 
1X PBS and sectioned (40 jm) coronally, sagittally or horizontally using a Vibratome 
(Leica). Slices were then immunostained (see Immunohistochemistry) or mounted 
on slides (Super Frost). After drying, slices were coverslipped with ProLong anti- 
fade mounting media containing DAPI (Molecular Probes) and imaged with an 
Olympus VS110 slide scanning microscope using the 10 objective. Fluorescent 
proteins introduced through rAAVs or transgenic alleles were never immune- 
enhanced, except in 3D brain reconstructions (see 3D brain reconstruction and 
analysis). Confocal images (1-2 1m optical sections) were acquired with an Olym- 
pus FV1000 laser scanning confocal microscope (Harvard Neurobiology Imaging 
Facility) through a 63 objective. 

Immunohistochemistry. Immunohistochemistry conditions were the same for both 
mouse and macaque sections. For ChAT immunohistochemistry, slices were incu- 
bated in a 1X PBS blocking solution containing 5% normal horse serum and 0.3% 
Triton X-100 for 1 h at room temperature. Slices were then incubated overnight at 
4 °Cin the same solution containing anti-choline acetyltransferase antibody (1:100, 
Millipore AB144P). The next morning, sections were washed three times for 5 min 
in 1X PBS then incubated for 1h at room temperature in the blocking solution 
containing donkey anti-goat Alexa 647 or Alexa 488 (1:500, Molecular Probes). For 
macaque sections, streptavidin conjugated to Alexa 350 or Alexa 488 (1:1,000, 
Molecular Probes) was also included in the secondary reaction to visualize bioti- 
nylated dextran amine (BDA) signal. The same protocol was used for NeuN (1:100, 
Millipore MAB377) and Parvalbumin (1:1,000, Millipore MAB1572) immuno- 
staining with anti-mouse Alexa 647 secondary antibodies (1:500, Molecular Probes). 
Immunostained mouse sections were mounted and imaged as described above. 
Immunostained macaque sections were mounted as described below. 
Retrograde tracing in a rhesus macaque. A 10-year-old male rhesus macaque was 
prepared for surgery under aseptic conditions. Anaesthesia was initiated with keta- 
mine (15 mgkg ') and valium (1 mgkg~') and the macaque was given an intra- 
venous catheter and intubated. Isoflurane (1-2% in oxygen) was used to maintain 
anaesthesia. Bilateral circular craniotomies were made over the frontal cortex, ex- 
posing the principal and arcuate gyri. Tracer injections were targeted to cortical 
areas that receive projections from the ‘Ch4id’ and ‘Ch4iv cell groups“. Specifically, 
tracers were injected along the principal and arcuate gyri (corresponding to the area 
between Mesulam et al. cases 23, 26 and 6) as well as the ventral orbital frontal 
cortex (Mesulam case 19) using a micromanipulator to guide a 10 pil Hamilton 
syringe. In the right hemisphere, Red 1X retrobeads (Lumafluor) were injected at 
16 dorsal locations and 4 ventral locations. In the left hemisphere, BDA (10% in 
sterile saline) was injected at 12 dorsal locations and 2 ventral locations. At each 
site, 0.5 ul of tracer was injected at two depths, 1 and 2 mm below the cortical sur- 
face. After injection, the skull fragments were replaced and the macaque was allowed 
to recover on a water-heated pad under constant observation. After a 21-day sur- 
vival period, the macaque was killed with a barbiturate overdose (>50 mg kg — to 
effect) and perfused through the heart with normal saline followed by 4% form- 
aldehyde in 1X PBS, pH 7.4. The brain was removed from the skull and post-fixed 
for 24h in the same fixative solution. Following cryoprotection for 3 days in 30% 
sucrose solution, the hemispheres were separated, blocked and cut in 40 um cor- 
onal sections using a freezing microtome. Throughout the extent of GP, coronal 
slices from both hemispheres were sampled at a rate of 1 out of 12 to check for 
retrograde labelling. To visualize BDA labelling in the left hemisphere, sliced were 
rinsed in 1X PBS and immunostained (see Immunohistochemistry). Slices from 
the retrobead-injected right hemisphere were unenhanced. Slices were mounted 
on gelatin-covered slides using an acetone-xylene drying procedure then cover- 
slipped with DPX mounting medium. Sampled sections from the right hemisphere 
showed non-specific fluorescent microsphere labelling throughout subcortical areas 
and thus were not considered further. Sampled sections from left hemisphere con- 
tained cells with retrograde labelling in the GP/nucleus basalis area consistent with 
Ch4iv and Ch4id groups. Flanking sections to those showing retrograde labelling 
were then double immunostained for ChAT and BDA (see Immunohistochemistry) 
and imaged as described above. All experimental manipulations were performed 
in accordance with protocols approved by the Harvard Standing Committee on 
Animal Care following guidelines described in the US National Institutes of Health 
Guide for the Care and Use of Laboratory Animals. 

Slice preparation. Acute brain slices were obtained from mice using standard tech- 
niques. Mice were anaesthetized by isoflurane inhalation and perfused through the 
heart with ice-cold artificial cerebrospinal fluid (ACSF) containing 125 mM NaCl, 
2.5mM KCl, 25mM NaHCO, 2mM CaCl, 1mM MgCl, 1.25mM NaH2PO, 
and 11 mM glucose (~308 mOsm per kg). Cerebral hemispheres were removed, 
placed in ice-cold choline-based cutting containing 110 mM choline chloride, 25 mM 
NaHCoO;, 2.5 mM KCl, 7 mM MgCh, 0.5 mM CaCh, 1.25 mM NaH,PO,, 25 mM 
glucose, 11.6 mM ascorbic acid, and 3.1 mM pyruvic acid, blocked and transferred 
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into a slicing chamber containing ice-cold choline-based cutting solution. Sagittal 
slices (350 jum thick) were cut with a Leica VT1000 s vibratome and transferred to 
a holding chamber containing ACSF at 34°C for 30 min and then subsequently 
incubated at room temperature. Both cutting solution and ACSF were constantly 
bubbled with 95% O/5% CO,. In a subset of experiments, acute brain slices were 
cut in ice-cold ACSF. 

Acute slice electrophysiology and two-photon imaging. Individual slices were 
transferred to a recording chamber mounted on a custom-built two-photon laser 
scanning microscope (Olympus BX51WI) equipped for whole-cell patch-clamp 
recordings and optogenetic stimulation. Slices were continuously superfused (3.5- 
4.5 ml min” *) with ACSF warmed to 32-34 °C through a feedback-controlled heater 
(TC-324B; Warner Instruments). Cells were visualized through a water-immersion 
60X objective using differential interference contrast (DIC) illumination. Epifluo- 
rescence illumination was used to identify those cells labelled by fluorescent micro- 
spheres and/or expressing fluorescent genetic markers. Patch pipettes (2-4 MQ) 
pulled from borosilicate glass (G150F-3, Warner Instruments) were filled either 
with a Cs*-based low Cl internal solution containing 135 mM CsMeSOs, 10 mM 
HEPES, 1 mM EGTA, 3.3 mM QX-314 (Cl salt), 4mM Mg-ATP, 0.3mM Na- 
GTP, 8 mM Na)-Phosphocreatine (pH 7.3 adjusted with CsOH; 295 mOsm per kg) 
for voltage-clamp recordings, or with a K* -based low Cl” internal solution com- 
posed of 135mM KMeSO3, 3mM KCl, 10mM HEPES, 1mM EGTA, 0.1mM 
CaCh, 4mM Mg-ATP, 0.3mM Na-GTP, 8mM Na>-Phosphocreatine (pH 7.3 
adjusted with KOH; 295 mOsm per kg) for current-clamp recordings. Alexa Fluor 
594 (20 uM) was added to both internals. Series resistance (<25 MQ) was measured 
with a 5 mV hyperpolarizing pulse in voltage clamp and left uncompensated. Mem- 
brane potentials were corrected for a ~7 mV liquid junction potential. After the 
recording was complete, cellular morphology was captured in a volume stack using 
740 nm two-photon laser light (Coherent). All recorded GP-FC neurons were la- 
belled with microspheres following injection in frontal cortex. In experiments where 
ChAT expression was not marked fluorescently, ChAT * or ChAT GP-FC neu- 
rons were distinguished based on soma size and spontaneous firing rate in cell- 
attached mode. Cortical neurons were classified as pyramidal or interneurons based 
on dendritic morphology and Gad1°"* expression. For analyses of intrinsic prop- 
erties shown in Fig. 1f and Extended Data Fig. 5a-f, mice were age P18-22. For 
whole-cell spontaneous firing rates shown in Fig. 1g, all ages assayed (P13-56) were 
included. For all analyses of intrinsic properties, NBQX (10 LM), CPP (10 1M) and 
SR95331 (50 UM) were included in the bath. For pharmacological analyses of syn- 
aptic transmission in Fig. 3a, b mice were P66-127. For optogenetic activation ex- 
periments of ChAT cells, NBQX (10 1M) and CPP (10 11M) were included in the 
bath. For ChAT™ cells, the bath solution was drug-free. IPSCs evoked from ChAT 
cells (Vicia = 0 mV) were blocked by the voltage-gated sodium channel blocker 
tetrodotoxin (TTX, 1 uM), demonstrating action potential-dependent ChR2-mediated 
transmitter release. IPSCs could be rescued in the continued presence of TTX by 
enhancing ChR2-mediated depolarization of terminals with application of the 
voltage-gated K* channel blocker 4-aminopyridine (4AP, 500 uM) and were sub- 
sequently abolished in the presence of the GABA, receptor antagonist SR95531 
(50 41M), indicating direct (monosynaptic) release of GABA by ChAT cells. For 
the ChAT* cell type, evoked EPSCs (Vicia = —70 mV) were unaffected by appli- 
cation of CPP and NBQX, but abolished by nicotinic receptor antagonist cocktail 
of MEC (10 uM), MLA (0.1 1.M) and DHBE (10 pM), indicative of ACh release and 
activation of ionotropic nicotinic receptors. IPSCs evoked from ChAT™ activation 
were blocked by $R95531, but not by CPP, NBQX, MEC, MLA or DHE, indica- 
ting GABA, receptor activation independent of glutamatergic and nicotinic sig- 
nalling. ChAT* IPSCs were blocked by TTX and rescued by 4AP, confirming direct 
release of GABA by ChAT" cells. For the screen of cortical synaptic connectivity 
reported in Extended Data Fig. 8b, NBQX and CPP were included in the bath for 
both ChAT* and ChAT~ GP-FC cell experiments. In a subset of these experi- 
ments, TTX and 4AP were also included. See Extended Data Fig. 8c, d for compar- 
ison. For FC layer 1 experiments involving optogenetic activation of all ChAT’. 
expressing neurons reported in Extended Data Fig. 7k, 1, mice were age P27-91 and 
NBQX and CPP were included in the bath. For optogenetic experiments involving 
iSPNs or dSPNs reported in Fig. 4b, c and Extended Data Fig. 10f-h, mice were age 
P30-37. NBQX, CPP, scopolamine (10 1M), CGP55845 (5 uM) & AM251 (10 11M) 
were included in the bath and quinpirole (8 {1M) and sulpiride (10 1M) were used 
for flow-ins. For experiments involving optogenetic activation of striatum reported 
in Extended Data Fig. 10e, mice were age P43-45 and NBQX and CPP were included 
in the bath. For experiments involving electrical stimulation of the STN-GP axons 
reported in Extended Data Fig. 9d, e, mice were aged P38-42 and SR95331 was 
included in the bath. 

Acute slice data acquisition and analysis. Membrane currents and potentials were 
recorded using an Axoclamp 700B amplifier (Molecular Devices) filtered at 3 kHz 
and digitized at 10 kHz using National Instruments acquisition boards and a cus- 
tom version of ScanImage written in MATLAB (Mathworks). Electrophysiology 
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and imaging data were analysed offline using Igor Pro (Wavemetrics), ImageJ (NIH), 
MATLAB (Mathworks) and GraphPad Prism (GraphPad Software). In figures, 
voltage-clamp traces represent the average waveform of 3-6 acquisitions; current- 
clamp traces are individual acquisitions. Passive membrane properties were cal- 
culated from current deflections in voltage clamp (Vicia = —70 mV). Cells were 
considered spontaneously active with maintained action potential firing (>20 s) 
within 2 min of whole-cell break-in. Average V,.s was calculated for non-spon- 
taneously active cells 1-3 min after break in. Peak amplitudes were calculated by 
averaging over a 1 ms window around the peak. In Extended Data Fig. 9d, e, AMPAR 
and NMDAR currents were isolated from the stimulation artefact by subtracting the 
NBQX resistant component (Viola = —70 mV) followed by the CPP/NBQX resis- 
tant component (Void = +40 mV) following a3 min wash-in period from current 
averages consisting of 10-15 consecutive acquisitions (20 s inter-stimulus interval). 
For pharmacological analyses in Fig. 4c current averages were calculated from 15 
consecutive acquisitions (20 s inter-stimulus interval) before and after a 3 min wash- 
in period and then normalized to averages corresponding to the same time with no 
drug flow in. For pharmacological analyses in Figs 3a, b, 3-7 consecutive acquisi- 
tions (20 s inter-stimulus interval) were averaged following a 3 min wash-in period 
for NBQX and CPP ora4 min wash-in period for MEC, MLA and DH&E. For TTX 
and 4AP conditions, current averages were composed of the acquisitions following 
full block or first-recovery of ChR2-evoked currents, respectively. Current responses 
reported in Extended Data Fig. 8b were considered monosynaptic if present in TTX/ 
4AP or <3.1 ms onset latency. Data (reported in text and figures as mean + s.e.m.) 
were compared statistically using the following Mann-Whitney test or Fisher’s 
Exact test. P values smaller than 0.05 were considered statistically significant. 
Optogenetic and electrical stimulation in acute slices. To activate ChR2 in acute 
slices, 473 nm laser light (Optoengine) was focused onto the back aperture of the 
60X water-immersion objective to produce collimated whole-field illumination. 
Square pulses of laser light were delivered every 20 s and power was quantified for 
each stimulation by measuring light diverted to a focal plane calibrated photodiode 
through a low-pass dichroic filter. For ChR2 introduced with rAAVs, light (2 ms; 
1.3-4.4mW per mm”) was used across conditions except in some cases following 
bath application of TTX and 4AP where increasing the power or duration of light 
stimulation was necessary to recover currents (for example, changing the duration 
from 2 to 4ms). For ChR2 activation of dSPN or iSPN inputs onto GP-FC cells 
(Drdla-Cre;Rosa26!! R?¥FP/+ or Adora-Cre;Rosa26's! PR? ©¥FP/* mice), a con- 
sistent light stimulation (1 ms; 1.3 mW per mm”) was delivered directly over the 
recorded cell and the resulting currents were used to compare synaptic strength 
across cells. For pharmacological analysis reported in Fig. 4c and Extended Data 
Fig. 10h and paired-pulse comparisons reported in Extended Data Fig. 10f, g, the 
objective was moved 0.16-1.4 mm into dorsal striatum (median = 0.4mm) and 
stimulation strength and duration (0.5-1 ms; 0.06-4.4 mW per mm”) were adjusted 
to produce first-peak currents between 26-547 pA (median = 226 pA). Stronger 
light powers (2-7 ms; 4.4 mW per mm”) were used activate ChR2 in ChA TC cells 
(ChAT *;Rosa26'hR?-*¥FP/* mice) in cortex (Extended Data Fig. 7k, ). For elec- 
trical activation of the STN axonal projection into GP, a bipolar tungsten electrode 
(TST33A10KT; World Precision Instruments) was placed at the anterior border of 
the STN and 0.1-0.5 ms square pulse of current was applied and power adjusted to 
maintain evoked currents while minimizing the stimulus artefact. 

Reagents. Drugs (all from Tocris) were applied via bath perfusion: SR95531 
(10 uM), TTX (1p.M), 4AP (500 1M), scopolamine (10 uM), (2S)-3-[[(1S)-1- 
(3,4-Dichlorophenyl)ethyl]amino-2-hydroxypropyl](phenylmethyl)phosphinic acid 
(CGP55845; 5 uM), N-(Piperidin- 1 -yl)-5-(4-iodophenyl)-1-(2,4-dichlorophenyl)- 
4-methyl-1H-pyrazole-3-carboxamide (AM-251; 10 1M), 2,3-dihydroxy-6-nitro- 
7-sulfamoyl-benzo(f)quinoxaline (NBQX; 10 1M), R,S-3-(2-carboxypiperazin-4- 
yl)propyl-1-phosphonic acid (CPP; 10 1M), N,2,3,3-Tetramethylbicyclo[2.2.1]heptan- 
2-amine, (MEC; 10 tM), [10,4(S),6B,14«,16B]-20-Ethyl-1,6,14,16-tetramethoxy- 
4-[[[2-(3-methyl-2,5-dioxo-1-pyrrolidinyl)benzoyl]oxy]methyl]aconitane-7,8-diol 
(MLA; 0.1 1M), (2S,13bS)-2-Methoxy-2,3,5,6,8,9,10,13-octahydro-1H,12H-benzo 
[i]pyrano[3,4-g]indolizin-12-one (DH&E; 10 1M), (S)-(-)-5-Aminosulfonyl-N-[(1- 
ethyl-2-pyrrolidinyl)methyl]-2-methoxybenzamide ((-)sulpiride; 10 UM) and (4aR- 
trans)-4,4a,5,6,7,8,8a,9-Octahydro-5-propyl-1H-pyrazolo[3,4-g]quinolone ((-)- 
quinpirole; 8 uM). CPP and NBQX were combined to make a cocktail of antago- 
nists to target ionotropic glutamate receptors, while MEC, MLA and DH&E were 
combined to make a cocktail to antagonize nicotinic receptors. 

Biocytin labelling of STN axonal projections. Acute parasagittal slices (350 1m 
thick, 10° off-sagittal) were prepared from wild-type mice (P34) as described above. 
This cutting orientation preserves the reciprocal connections between the STN and 
GP”. In circulating warm ACSF (32-34 °C), patch pipettes filled with biocytin (0.2%, 
Molecular Probes) containing internal solution containing 135 mM KMeSO,, 5 mM 
KCl, 5 mM HEPES, 4 mM Mg-ATP, 0.3 mM Nap-phosphocreatine (pH 7.4 adjusted 
using KOH; 295 mOsm per kg) were targeted to the STN under DIC illumination. A 
picospritzer (Picospritzer III; Parker Instrumentation) was used to puff the solution 
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into the centre of STN for 1 h (400 ms pulse at 1 Hz, 5-10 p.s.i.). The slice was allowed 
to recover for 0.5 h before being transferred to 4% PFA in 1X PBS overnight. The 
following morning, slices were rinsed in 1X PBS before avidin-biotinylated HRP 
complex (ABC) processing (Vectastain). Slices were then wet-mounted onto glass 
slides, coverslipped and imaged under bright-field illumination using the 10 
objective of an Olympus VS110 slide scanner microscope. 

In vivo electrophysiology. At least one week after the initial injections, male mice 
were surgically implanted with a permanent titanium headpost. In this surgery the 
coordinates for GP, FC and M1 were marked on the surface of the skull based on 
stereotaxic coordinates. The headpost was secured and the animal’s skull was cov- 
ered with C&B metabond (Parkell Inc.). Animals were subsequently housed alone 
and allowed to recover for one week before habituation to restraint. Mice were 
water restricted (to a target of 85% of their free feeding weight) and habituated to 
restraint in a custom-made lever-press training rig. This rig allows the animal to 
press a lever in response to an auditory cue. Animals were trained for increasing 
durations for at least 3 days until they were able to tolerate head restraint for 90 min 
without struggling. In this time animals achieved an intermediate level of perform- 
ance on the task, where they knew the association between lever press and reward, 
but were not proficient at recognizing and responding to tones. One day before the 
first recording day animals were anaesthetized with isoflurane and received a cra- 
niotomy over the region of interest. In this surgery, if necessary, a chronic fibre 
(62.5 [um core multimode fibre (http://www.ecablemart.com) attached to a ceramic 
LC ferrule (Pfp Inc.) was implanted at the same coordinates as the GP viral injec- 
tion, but at a depth of 2.8 mm. All chronic fibres were pre-screened to have >80% 
transmission at 473 nm light. A ceramic ferrule connecter linked the chronic fibre 
toa standard LC cable. Light shuttering and output control was through an acousto- 
optic modulator (AA systems) and had a >1,000:1 occlusion ratio. Recordings were 
made using 16- or 32-channel silicone probes with 177 sm? recording sites (Neu- 
ronexus Technologies) spaced 50 tm apart and lowered to a depth of ~1,000 um 
below the surface of the brain. Optrode recordings were made with a 16-channel 
probe of the same configuration fitted with an optical fibre 100 j1m above the top 
recording site (Neuronexus Technologies, OA series). Stimulation intensity for blue- 
light fibre stimulation was 5 mW from fibre tip (~ 400 mW per mm”), optrode 
stimulation was 10 mW from fibre tip (~800 mW per mm”) and yellow-light fibre 
stimulation was 3 mW from fibre tip (~250 mW per mm”). All in vivo electrophys- 
iology was acquired using the omniplex system (Plexon Inc.) and filtered at 300- 
8 KHz. Spike detection was done by level crossing, generally at 50 ,1V and clustering 
to remove the noise cluster using an offline sorter (Plexon Inc.). Units that were 
separable were counted separately, but many units were accepted as multi-unit. All 
analysis here assumes each unit as a possible multi-unit and is only separated when 
necessary. Units with firing rate <0.1 Hz in the baseline period were excluded for 
analysis. All analysis was performed using custom scripts in Igor Pro. Data visu- 
alization and statistical analysis was done using Igor Pro and GraphPad Prism. 
Index of Modulation (ight) was calculated as the difference in firing rate with the 
light on, minus the firing rate of the light off divided by the sum of those firing rates, 
it varies from — 1 to 1 where a 0 indicates no change and 1 indicates all activity was 
detected with the light on. Latency analysis was carried out similar to other groups”, 
firing rate for each unit was binned in 50-ms periods aligned to the onset of opto- 
genetic stimulation. The mean and standard deviation of the firing rate was deter- 
mined for a baseline period one second long ending with the light onset. The first bin 
after light onset to deviate more than + 2 s.d. from the baseline mean was reported 
as the bin of first change. Units where no change was detected within the first 
500 ms were ignored. By this metric units can have both an increasing and decreas- 
ing first bin irrespective of the net change in firing rate. The z score of millisecond- 
timescale firing-rate changes was calculated by obtaining a mean and standard 
deviation of firing in 20 X 5 ms bins from a baseline period 3 s before each 5 ms blue 
pulse. This baseline period had the same number of pulses as the actual stimulation 
trial (4,500-6,000 pulses) but was shifted to a period of no illumination. The actual 
firing rate was z scored (mean subtracted and divided by the s.d.) by this baseline 
mean and s.d., to find significant deviations. z scores from each unit were averaged 
to create a population value (Fig. 2c and Extended Data Fig. 6g). All mCherry con- 
trol experiments were performed in ChAT’ mice following transduction of rAAV 
DIO-mCherry in the GP. All ArchT experiments were performed in Vgat'“* or 
ChAT“ mice following transduction of rAAV DIO-ArchT-tdTomato into the 
GP. 

Three-dimensional brain reconstruction and analysis. Following fixation, brains 
were frozen and sectioned at 50 lum. GFP and tdTomato were immuno-enhanced in 
free-floating sections using mouse-anti GFP (1:1,000, abCam ab1218) and rabbit- 
anti RFP (1:2,000, Rockland Antibodies 600-401-379) followed by anti-mouse Alexa 
488 (1:200, Jackson Immunoresearch 115-545-062) and anti-rabbit Cy3 (1:2,000; 
Jackson Immunoresearch 111-165-144) secondary antibodies. Sections were mounted 
on slides and counterstained with Neurotrace Blue (Invitrogen) and imaged on a 
Zeiss microscope with a Lud] motorized stage controlled with Neurolucida software 


(Microbrightfield). Imaging was performed with a 10X objective and a Hamamatsu 
Orca Flash 4 camera. Each coronal section containing between 80-200 tiles merged 
with Neurolucida software. Coronal sections were aligned and Nissl labelling nor- 
malized using Neurolucida, Adobe Photoshop and Image] software. Aligned sec- 
tions were rendered in three dimensions and cortical areas defined using Imaris 
software (Bitplane). Custom algorithms were written in MATLAB (Mathworks) to 
detect and quantify axons by cortical area. In brief, 2D multiscale hessian filtering” 
was followed by non-maximum suppression and then by hysteresis thresholding**. 
Hysteresis thresholds were applied to both the response (largest eigenvalue of hes- 
sian) and anisotropy (difference of eigenvalues after normalizing by Gaussian filter 
response). For anterior—posterior analysis of axon densities (Fig. 1c), raw data was 
five-point median filtered and peak normalized. For average fluorescence mea- 
surements across layers (Fig. 1d), the fluorescence signal from each channel was 
normalized to peak and baseline subtracted by the white matter signal. 
Array tomography. Mice were deeply anaesthetized with isoflurane and transcar- 
dially perfused with 4% PFA, 2% sucrose in 0.1 M sodium phosphate buffer (1 
PBS). Brains were post-fixed for one day, washed in 1X PBS and sectioned into 
300-m sagittal slabs using a Vibratome (Leica). Frontal cortex was then cut out using 
a razor blade and dehydrated through a series of alcohol dilutions before being 
infiltrated with LR White acrylic resin (Sigma Aldrich L9774-500G) overnight. 
Tissue was then placed in a LR White-filled gel cap that was polymerized at 50 °C 
overnight. Blocks of tissue were cut on an ultramicrotome (Leica EM UC7) into 
ribbons of 70-nm thin sections. Sections were then manually lifted onto gelatin- 
coated slides, air dried and then heated on a hot plate (~80 °C) for 10 min. Slides 
were marked by PAP pen liquid blocker. Ribbons were treated with 50 mM glycine 
in 1X TBS for 5 min, followed by 5 min in blocking buffer (5% BSA, 0.05% Tween 
20 in TBS) before primary antibody staining. Staining was performed in blocking 
buffer overnight at 4 °C. After primary antibody staining, the slides were washed 
three times with TBS, followed by blocking buffer for 5 min. Secondary antibody 
staining was performed in blocking buffer for 30 min at room temperature. All 
secondary antibodies were used at a dilution of 1:150. Slides were washed with TBS 
and then rinsed with ultra-pure H,O for 20 s. SlowFade Gold antifade reagent with 
DAPI (Molecular Probes $36939) was added to each slide before coverslipping. 
After imaging, the coverslip was removed and the slide was treated with elution 
buffer (0.2 M NaOH, 0.1% SDS in dH2O) for 20 min at room temperature. The 
slide was washed with TBS three times and rinsed with ultra-pure H,O for 20s. 
After rinsing, each slide was allowed to air dry, then put on a hot plate for 10 min 
before staining. Subsequent stains followed the same protocol. The dilutions and 
staining order are as follows: Stain 1: chkaGFP 1:100(GTX13970, GeneTex); 
musaGephyrin 1:100(612632, Biosciences Pharmingen); rabaGAD 65-67 1:1,000 
(ab11070, Abcam); Stain 2: raboPSD95 1:100(3450, Cell Signaling Tech.); Stain 3: 
musoBassoon 1:100 (ab82958, Abcam); rabaSynapsin 1 1:100 (5297S, Cell Signal- 
ing Tech.); Stain 4: rabo Parvalbumin 1:100(PV-25, Swant); Stain 5: mus#VAChT 
1:100(139 103, Synaptic Systems); raboVGAT 1:100(131 011, Synaptic Systems). 
Imaging was performed using a Zeiss Axio Imager Z1 Upright Fluorescence Micro- 
scope. A position list was generated at 20 in cortex to identify the regions of in- 
terest on each section. Four images were then acquired with Zeiss Plan-Apochromat 
63X/1.4 oil DIC objective and stitched into a single final image (Mosaix, Axio- 
vision). Individual stacks were aligned in FIJI using the MultiStackReg plugin, 
initially using the DAPI channel and then a second alignment to the stack from the 
first imaging session. Fine alignments were then performed using the synapsin I 
stack and the Register Virtual Stack Slices plugin of FIJI to correct for warping. 
Background fluorescence was then subtracted from the aligned stacks using a 10 
pixel rolling ball filter and contrast was adjusted to 0.1% through the FIJI software. 
Image analyses were carried out with custom-written scripts in MATLAB (Math- 
works). GFP* volumes and diffraction-limited synaptic markers were computa- 
tionally detected from image stacks. GFP* ‘pearls’, which correspond to putative 
presynaptic terminals (PSTs), were identified as varicosities belonging to a ‘string’ 
with multi-plane-spanning volumes. All synaptic markers were treated as single 
pixel point sources within a given z plane. Images were segmented to exclude DAPI* 
nuclei and areas containing no tissue. Co-localization analyses of GFP* pearls and 
synaptic markers were performed by quantifying the mean voxel densities within 
GEP* pearls (0 distance) and from 102-512 nm outside pearl volumes. Mean voxel 
densities for real data were compared to those mean densities (Extended Data Fig. 8i) 
following randomization of diffraction-limited immunopunctae (1,000 rounds). z 
scores (as in Fig. 3e; Extended Data Fig. 8k) were calculated from 0 distance mean 
densities following 10 rounds of GFP* pearl randomization: 


(meanyeal = MeaNyandom ) 
2 2 
Freal ai Frandom 
Neal Nyandom 


In total, the array tomography data set consisted ofn = 8 stacks from n = 2 mice, 
with n = 4 stacks in layers 1-3 and n = 4 stacks in layer 5. Each stack consisted of 


zZ score 
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26-31 X 70 nm slices, with a total volume of 7.6 X 10° um?. Note that the plot in 
Fig. 3f, which reports the distances between VGAT-VAChT punctae within the 
same GFP* pearl, excludes n = 8 singleton values between 1.2-1.6 pm. 

Code availability. Code for axon detection and array tomography co-localization 
analysis is available upon request. 

Statistical methods. No statistical methods were used to predetermine sample 
size. 
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Extended Data Figure 1 | Anatomical and molecular properties of GP-FC 
cells and ChAT* cells of the substantia innominata and ventral pallidum. 
a-c, GP-FC cells project exclusively to ipsilateral cortex. a, Low-magnification 
horizontal section from a wild-type mouse injected bilaterally in FC with 

red (right hemisphere) and green (left hemisphere) retrobeads. DAPI (blue), 
nuclear stain. Boxed insets show location of GP in b. Green signal in the 
right hemisphere is due to bleed through from the red channel. b, High- 
magnification of left and right GP from the same brain as in a. Retrobead * cells 
from ipsilateral (ipsi) and contralateral (contra) injections are highlighted 
with white circles. Dashed lines demarcate the approximate boundaries of the 
GP. c, Summary graph showing nearly all retrobead” cells (n = 436 of 437, 
from four mice) resulted from injections in ipsilateral FC. d, e, FC retrobead 
labelling in Vgat"“*;Rosa26'""** Teen or Gad2'"*:Rosa26! Gre" mice 
followed by ChAT immunostaining (magenta) demonstrates that nearly all 
retrobead* GP cells (red) are Vgat'“"*-positive or Gad2'“"*-postive (green) 
while a subset of retrobead* neurons are also ChAT * (solid circles) and the 
remainder are ChAT (dashed circles). Nearly all retrobead* GP cells were 
Vgat* (n = 159 of 159 cells, from three mice) or Gad2* (n = 231 of 233 cells, 
from two mice), whereas 72% were ChAT* (n = 215 of 300 cells, from five 
mice) and 28% ChAT (n = 85 of 300 cells). NB, nucleus basalis. d. Top, 
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low-magnification sagittal view of the GP. Bottom, a single confocal plane from 
stacks used to quantify marker co-localization in Vgat'“*;Rosa26" “Cre? 
mice. e, A single confocal plane from stacks used to quantify marker co- 
localization in Gad2'"*;Rosa26!! Gre" mice. f, g ChAT* neurons of the 
substantia innominata (SI) and ventral pallidum (VP) also express Vgat and 
Gad2. GPm, medial globus pallidus (entopeduncular nucleus). f, Top, low- 
magnification ventral view of a sagittal section from a Vgat'“"*;Rosa26"" "Cre" 
mouse immunostained for ChAT. Bottom, high-magnification view of 

the substantia innominata and bordering ventral pallidum. g, High- 
magnification view of the substantia innominata/ventral pallidum in a 
Gad2'“*;Rosa26!s! See" mouse immunostained for ChAT. h-j, ChAT ~ 
GP-FC cells do not express parvalbumin (PV). h, Low-magnification view of 
sagittal section through the GP of a wild-type mouse injected with retrobeads 
(red) in FC and immunostained for ChAT (green) and parvalbumin (magenta). 
White circles highlight retrobead” GP-FC cells. i, A single confocal plane 
showing retrobead* GP-FC cells (circled in white) and immunostaining 

for ChAT and parvalbumin. j, Confocal quantification of co-localization 
between retrobead* ChAT” GP-EC cells and parvalbumin (n = 32 cells, 
from two mice). 
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Extended Data Figure 2 | ChAT* and ChAT” GP-FC cells are present in a 
rhesus macaque. a, In order to label frontal cortical projection neurons 
from Ch4iv and Ch4id regions of the nucleus basalis adjacent to the GP of a 
rhesus macaque”, the neuronal tracer biotinylated dextran amine (BDA) was 
injected at multiple sites along the arcuate and principal gyri and in the 
orbital cortex. Left, dorsal (top) and ventral (bottom) views of a fixed macaque 
brain. Dashed boxes indicate the injected areas. Right, schematic of the 
injection sites. Blue circles correspond to 2 X 0.5 ul BDA injections at 1 and 
2mm below the pial surface. b, Coronal section through the injection area 
after immunostaining to visualize BDA. c-e, Immunostaining for BDA and 
ChAT identifies retrogradely labelled ChAT* and ChAT” GP-FC cells. 

c, Coronal section from macaque atlas containing GP and nucleus basalis. 

d, Left, ChAT immunostaining highlights traditional anatomical boundaries of 
the GP—putamen and GP-nucleus basalis. Same plane as in c. Right, higher 
magnification view of GP-nucleus basalis border corresponding to the inset in 
c. ChAT* neurons are distributed around the ventral GP/dorsal nucleus basalis 
and along laminae separating the GP from the putamen (lateral medullary 
laminae, Iml) and globus pallidus internus (medial medullary laminae, mml). 
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Arrow and arrowhead indicate approximate locations of BDA* ChAT* 

(680 jm anterior) and BDA* ChAT” (360 um anterior) example GP-FC cells 
shown in e. e, Single confocal planes showing example BDA*ChAT* and 
BDA'ChAT GP-FC cells. f, g, ChAT immunostaining in a Drd2-eGFP 
mouse distinguishes traditional anatomical boundaries of the GP—nucleus 
basalis from the territory occupied by iSPN axons. f, Coronal section from the 
mouse atlas. g, Left, ChAT immunostaining highlights traditional anatomical 
boundaries of the GP-striatum and GP-nucleus basalis. Same plane as in 

f. Right, higher magnification view of GP-nucleus basalis border region 
corresponding to the inset in f. As in macaque, ChAT™ cells are distributed 
along GP borders between striatum and the internal capsule (ic) and at the 
border of ventral GP-dorsal nucleus basalis. Overlay of GFP fluorescence 
demonstrates iSPN axons arborize throughout the GP, abutting ChAT" cells 
on the GP border regions (arrow), and ventrally in the dorsal nucleus basalis 
(arrowhead). AA, anterior amygdaloid area; ac, anterior commissure posterior; 
CeL, central lateral division of amygdala; CeM, central medial division of 
amygdala; GP, globus pallidus externus; GPi, globus pallidus internus; Lh, 
lateral hypothalamus; Pu, putamen; SI, substantia innominata; Str, striatum. 
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Extended Data Figure 3 | Validation of ChAT‘ knock-in mouse and 
rAAV strategy for Cre-On/Off labelling of GP-FC axons in cortex. a-c, The 
ChAT*“* mouse expresses Cre selectively in ChAT* GP/nucleus basalis 
neurons with high penetrance. a, Left, low-magnification view of sagittal 
section through ChAT’ ‘Rosa26!" ‘dTomato mouse immunostained for ChAT. 
Right, inset showing higher magnification view of the GP and nucleus basalis. 
Dashed line approximates the boundaries for quantifying Cre-reporter and 
ChAT co-localization. b, Single confocal plane showing overlap of Cre-reporter 
expression and ChAT immunostaining of ChAT™ cells at the GP-nucleus 
basalis border. c, Quantification of confocal co-localization between the 
Cre-reporter* and ChAT*® cells (n = 471 of 484 tdTomato* ChAT*, n = 12 
of 484 tdTomato~ only, n = 1 of 484 ChAT* only, from three mice). 

d-f, Transduction of the GP in a ChAT’“* mouse with DIO-eGEP (Cre-On) 
and FAS-tdTomato (Cre-Off) rAAVs effectively targets GFP and tdTomato to 
ChAT* and ChAT™ cells respectively. d, Sagittal section through the GP 
following transduction of the DIO-eGFP (green) and FAS-tdTomato (red) 
rAAVs and immunostaining for ChAT (magenta). e, Single confocal plane 
showing ChAT™ cells (circled in white) co-localized with GFP but not 
tdTomato. f, Quantification of confocal co-localization between ChAT, GFP 
and tdTomato (n = 319 ChAT" cells, from two mice). g, Left, coronal atlas. 
Right, injection site for injection 1 (Fig. 1b-d), showing ChAT* (Cre-On) 
expression limited to GP and the immediately adjacent nucleus basalis. ChAT — 
(Cre-Off) expression is limited to GP with slight leak in striatum. ChAT* and 
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K charicre rAAV FAS-tdTomato fAAV DIO-EGEP _ 
x 
ChAT™ axons arborize in the thalamic reticular nucleus (Rt). SI, substantia 
innominata; NB, nucleus basalis; ic, internal capsule. h, Coronal section of 
anterior M1 illustrating automated axon detection of ChAT and ChAT* 
axons. i, Densities of detected ChAT” (green) and ChAT™* (magenta) axons 
along with total cortical area (black) in successive 50-um coronal slices 
(anterior—posterior) across the cortical mantle (n = 2 mice). Densities and area 
are shown raw (thin line) and after five point median filtering (thick line). 
Median-filtered data are reported in Fig. 1c. j, Average density of ChAT* 
and ChAT GP-FC axons by cortical area. FrA, frontal association; PrL, 
prelimbic; MO, medial orbital; LO, lateral orbital; VO, ventral orbital; GI/DI, 
granular/dysgranular insular; AJ, agranular insular; M1, primary motor; M2, 
secondary motor; S1, primary sensory; S2, secondary sensory; Cg, cingulate; 
IL, infralimbic; DLO, dorsolateral orbital; Pir, piriform; A2, secondary auditory; 
A], primary auditory; RS retrosplenial; V1, primary visual; V2, secondary 
visual; TeA, temporal association; PTA, parietal; Ent, enthorhinal; Ect, 
ectorhinal; PRh, perirhinal. k, Coronal section illustrating the distribution of 
GP-FC axons across layers of ectorhinal cortex, a posterior cortical area that 
receives a large GP—FC projection. ChAT axons target superficial layers 1 
and 2/3 (arrow), in addition to deeper layers 5 and 6, as in anterior cortices 
including M1. The ChAT* axons arborize across all cortical layers in both 


ectorhinal and anterior cortical areas. rh, rhinal fissure; Str, striatum; 
wm, white matter. 
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Extended Data Figure 4| ChAT* and ChAT~ GP-FC cells target distinct 
but overlapping subcortical nuclei. a, Coronal sections from 3D brain 
reconstructions illustrating subcortical nuclei targeted by ChAT* and ChAT” _ magnification views of subcortical nuclei. 
axons following transduction of ChAT’ “* GP/dorsal nucleus basalis with 
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rAAVs DIO-eGFP (magenta, Cre-On) and FAS-tdTomato (green, Cre-Off) 
(n = 2 mice, examples from injection 1). Left, coronal atlas. Right, high- 
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Extended Data Figure 5 | GP-FC cells are distinguished by active and 
passive membrane properties. a, Maximum intensity two-photon projections 
of example ChAT" (left) and ChAT~ (right) GP-FC cells following whole-cell 
recording. Dashed insets show high-magnification projections through 
dendrites. b, ChAT* cells have larger soma than ChAT cells. Soma volumes 
were quantified from two-photon stacks (from four P18-22 mice; n = 8 
ChAT* cells, 1 =8 ChAT cells). c, Passive membrane properties of GP-FC 
cells (n =9 ChAT ,n = 10ChAT", from four P18-22 mice). d, Representative 
waveforms of spontaneously active ChAT” and ChAT* GP-FC cells. 

e, Schematic of quantified membrane properties following positive and 
negative current injections. ISI, interspike interval. f, Active membrane 
properties of GP-FC cells (n = 8-9 ChAT, n = 10 ChAT*). Representative 
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action potential waveforms from a spontaneously active ChAT cell or 

from minimal current injection ina ChAT™ cell. Evoked firing rates were 
calculated for 500-ms current injections. *P < 0.05 (Mann-Whitney test). 

g, Developmental comparison of GP-FC membrane properties before (P13-14, 
n= 2 mice) and around (P50-56, n = 3 mice) sexual maturity. Left, ChAT ~ 
GP-FC cells are spontaneously active throughout postnatal development 
(P13-14, n = 4 of 4; P50-56, n = 4 of 5), while ChAT* cells tend to become 
spontaneously active after sexual maturity (P13-14, n = 1 of 11; P50-56, n = 7 
of 8). GP-FC membrane resistance (middle) does not change before and 
after sexual maturity but membrane capacitance (right) is reduced (ChAT: 
P13-14, n = 4; P50-56, n = 5; ChAT™: P13-14, n = 9; P50-56, n = 9). 

*P <(.05 (Fisher’s exact test); error bars denote mean = s.e.m. 
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Extended Data Figure 6 | Optogenetic manipulations of GP-FC cells 
coupled with in vivo extracellular FC recordings in an awake behaving 
mouse. a, Summary plot of mean (and 95% confidence interval) indices for 
light modulation (Iigi) of FC unit firing rate by experimental condition. 
Dashed line denotes average of blue light fibre mCherry* control. The number 
of mice for each condition are shown in parentheses. *P < 0.0001, Kruskal- 
Wallis test. b, Pulsed ChR2 depolarization of Vgat “Cre GP somata increases 
firing rates in FC on a millisecond timescale. Left, experimental schematic. 
Extracellular recordings in FC are coupled with fibre-delivered pulses of blue 
light (5-ms pulses of 473 nm, 10 Hz for 3s) in GP. Middle, mean firing rate 
(+s.e.m.) of all FC units in response to pulsed blue light in ChR2* (blue) or 
control (black) mice (ChR2*, n= 90 units from five mice; mCherry”, n=99 
units from three mice). Dotted line represents mean pre-stimulation firing rate. 
Right, pseudo-coloured plot of normalized firing rate for all units. c, Fibre- 
delivered pulsed blue light illumination of mCherry* GP somata in ChAT’ 
mice shows no light-induced changes in the firing rates of FC units above 
chance (increased, 0 of 99; decreased, 2 of 99; from three mice). Right, 
pseudo-coloured plot of normalized firing rate for all units. d, Fibre-delivered 
constant yellow light illumination of mCherry* GP somata in ChAT’ “" mice 
shows no light-induced changes in the firing rates of FC units above chance 
(increased, 1 of 63; decreased, 2 of 63; from two mice;). Right, pseudo-coloured 
plot of normalized firing rate for all units. e, Latencies of light-induced 
modulation of FC firing following fibre-based illumination of ChR2* (pulsed 
blue light) or ArchT™ (constant yellow light) GP somata in Vgat'"* mice. 
Top, spike raster plots (upper) and firing rate histograms (lower, 50-ms bins) 
from example FC units exhibiting light-induced decreases and increases in 
firing rates. Onset times for light pulses are shown with coloured dash lines. 
Bottom, summary graph plotting light-effect latencies for those FC units with 
statistically significant modulations. Firing rates are binned every 50 ms, 

such that ‘Bin of first change’ = 0 contains the spikes from 0-50 ms after light 
onset. First change is determined as the first bin to deviate more than +2 s.d. 


from the mean baseline firing. Units in which no change is detected within 
500 ms are excluded. Individual units may have a first increasing and first 
decreasing bin if their activity is biphasic. Mean 50 ms bin of first activated by 
ArchT in Vgat! “Cre mice was 3.0 + 0.5, n = 47 of 96 units; first suppressed 

4.3 + 0.5, n = 28 of 96 units; first activated by ChR2 2.9 + 0.3, n = 22 of 90 
units, first suppressed by ChR2 0.7 + 0.2, n = 35 of 90 units. f, Optrode- 
delivered pulsed blue light illumination of ChR2* axons in FC from Vga 
mice shows no persistent changes in the firing rates of FC units above chance 
(increased, 1 of 111; decreased, 2 of 111 from five mice). Right, pseudo-coloured 
plot of normalized firing rate for all units. g, Pulsed blue light illumination 

of GP-FC axons using an optrode leads to increases in firing rate on the 
millisecond timescale. Left, mean (+s.e.m.) firing rates on a millisecond time 
scale of all units in response to pulsed blue light illumination of GP axons in FC 
for Vgat'“* mice expressing ChR2 (blue) or mCherry (black) or ChAT*“® 
mice expressing ChR2 (purple). Right, z score of inter-pulse interval firing rates 
(20 X 5-ms bins) comparing positive and negative deviations from preceding 
baseline period without light. h, Optrode-delivered pulsed blue light 
illumination of ChR2* axons in FC from ChAT*“ mice shows no persistent 
changes in the firing rates of FC units above chance (increased, 0 of 74 from 
three mice; decreased, 0 of 74). Right, pseudo-coloured plot of normalized 
firing rate for all units. i, Fibre-delivered constant yellow light illumination of 
ArchT* GP somata in ChAT“ mice shows no light-induced changes in 
the firing rates of FC units above chance (increased, 1 of 120; decreased, 3 of 120 
from four mice). Right, pseudo-coloured plot of normalized firing rate for 

all units. In plots of high, red bars indicate units that were statistically 
significantly modulated by light (t-test, P< 0.05). For pseudo-coloured plots of 
normalized firing rate, units are normalized to the baseline period, before 
light onset, and ordered by Night (low to high). Blues and purples represent low 
firing rates whereas yellow and red represent higher firing rates. Red represents 
modulations three or more times baseline. 
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Extended Data Figure 7 | ChR2-mediated stimulation of ChAT*“* axons 
following rAAV transduction or with a Cre-activated allele evokes ACh- and 
GABA-mediated currents. a, b, Targeting ChR2 expression to ChAT* and 
ChAT” GP-FC cells. a, Schematic of ChAT* (magenta) or ChAT (black) 
GP-FC axons expressing ChR2-mCherry after DIO (Cre-On) or DO (Cre-Off) 
rAAV transduction in the GP of ChAT’ “"°;GAD1@ mice. b, rAAV DO- 
ChR2-mCherry transduced into the ChAT*“* GP expresses ChR2-mCherry 
selectively in Cre” neurons. Single confocal plane showing neighbouring 
ChR2-mCherry* soma (dotted outline) and ChAT* soma at the GP-nucleus 
basalis border. Of 158 ChR2-mCherry* neurons surrounding 223 ChAT* 
neurons, 0 were ChR2-mCherry ChAT* (from two mice). c, ChAT" axons 
surrounding GADI“™ expressing cells in FC layer 6. d-f, ChAT * GP-FC cells 
ramify local axon collaterals around the GP-nucleus basalis border. d, Sagittal 
atlas with the GP-nucleus basalis border indicated with a dashed box. e, Left, 
low-magnification view of ChAT“ GP following transduction with rAAV 
DIO-synaptophysin-mCherry. DAPI (grey), nuclear stain. Right, maximum 
projection confocal stack (28 jum) of inset region. Example putative presynaptic 
boutons indicated by arrowheads. f, Left, low-magnification sagittal section 
from ChAT’ “;Rosa26S*?*FP/* mouse. Right, high-magnification inset of 
GP showing distribution of neurons (NeuN immunostain, magenta) and 
ChR2-eYEP* processes (white). g, h, Following rAAV transduction in 
ChAT“ mice, ChR2 activation of local ChAT’ axon collaterals results in 
rare nicotinic EPSCs but prevalent GABAergic IPSCs onto ChR2 GP/nucleus 
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basalis neurons (EPSC = 2 of 85 cells; IPSC = 7 of 85 cells from six mice). 

g, Light-evoked EPSC from an example ChR2 GP/nucleus basalis cell voltage- 
clamped at —70 mV (top) was insensitive to glutamate receptor block with 
NBQX and CPP (middle), but abolished by bath application of MEC, MLA & 
DHR&E (bottom), suggesting the EPSC resulted from ACh release and 
nicotinic receptor activation. h, Summary of peaks from nicotinic EPSCs and 
GABAergic IPSCs evoked from ChAT“ axons onto ChR2~ GP/nucleus 
basalis cells. i, Left, low-magnification image of sagittal section from a 

ChAT’ *;Rosa26'* C"**F?/* mouse. Right, high-magnification of inset from 
frontal cortex showing distribution of neurons (NeuN immunostain, magenta) 
and ChR2-eYFP (white), expressed in axons from basal forebrain and in 
local cortical interneurons. j, Maximum intensity two-photon projection of a 
layer 1 interneuron following whole-cell recording. k, Left, light-evoked current 
responses from two layer 1 interneurons held at indicated potentials to 
optogenetic activation in a ChAT’ C;Rosa26!ChR2-eYFP/+ mouse in baseline 
conditions (black, NBQX & CPP) and after bath application of nicotinic 
receptor antagonist cocktail (red, MEC, MLA & DHBE). Right, nicotinic EPSCs 
are blocked by bath application of the non-selective blocker MEC alone (green). 
1, Time until full block of light-evoked nicotinic EPSCs following bath 
application of either nicotinic receptor antagonist cocktail (MEC, MLA & 
DHBE, n = 5 cells from three mice) or MEC alone (n = 5 cells from three mice). 
Inter-stimulus interval = 20s. 
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Extended Data Figure 8 | Synaptic connectivity and array tomography 
marker co-localization analysis of GP-FC axons in FC. a-d, Ionotropic 
synaptic connectivity of ChAT" and ChAT” GP-FC neurons onto FC cell 
types and layers. a, Example morphologies of FC neurons identified as 
pyramidal (from layer 5) or an interneuron (from layer 1). b, Summary of 
cortical neurons with ChR2-evoked monosynaptic ionotropic GABAergic or 
nicotinic currents from ChAT* or ChAT” axons by cortical layer. c, Peak 
currents induced by ChR2 activation of ChAT* or ChAT” GP-FC cells in FC. 
Postsynaptic cells are grouped across layers as pyramidal or interneurons. 
Left, GABAergic IPSCs reported with either TTX/4AP in the bath or following 
wash-in are plotted with dotted data. IPSCs recorded in baseline conditions 
(ChAT, NBQX & CPP; ChAT*, ACSF only) are plotted with undotted data. 
Each cell is represented once. (ChAT : n= 5 pyramidal, n = 15 interneurons 
from 13 mice; ChAT*: n = 3 interneurons from 15 mice). Right, nicotinic 
EPSCs recorded in ACSF, present after bath application of CPP and NBQX and 
fully blocked by nicotinic receptor antagonists (MEC, MLA, DH®E, n = 5 
interneurons from 15 mice). d, Onset latencies for IPSCs and EPSCs induced by 
ChR2 activation of ChAT* or ChAT” GP-FC cells under baseline conditions 
only (ACSF) or in the presence of TTX/4AP. ‘Baselineyrx;sap’ refers to the 
subset of cells with IPSCs recorded under both baseline conditions and 
recovered following wash-in of TTX/4AP (n = 5; same data as Fig. 3b). Onset 
latencies of ChAT” IPSCs were longer in TTX/4AP (n = 14) than ACSF 
(n= 11). *P < 0.05 (Mann-Whitney); error bars denote mean = s.e.m. 

e-k, Array-tomography-based co-localization analysis of ChAT™ presynaptic 
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terminals (PSTs) in FC. e, Left, 300 1m sagittal slab from a ChAT'C* mouse 
injected with 300 nl of rAAV DIO-synaptophysin-GFP in GP. Right, box 
indicates area of FC prepared for array tomography. f, Automated detection 
of GFP* volumes (pearls) and synaptic markers. Left, maximum projection 
of GFP” axons following computational detection of string-associated 

pearls (in red). Right, a single 70 nm plane showing diffraction-limited 
immunohistochemical punctae for PSD-95 and computational detection of 
point sources (in red). g, Maximum projection (z = 2.17 tm) through layers 
1-3 of FC following injection of rAAV DIO-synaptophysin-GFP into the GP of 
a ChAT’ mouse. Inset shows location of axon shown in Fig. 3c. h-k, GFP* 
pearls are putative GABAergic PSTs. h, Individual volumes for all detected 
GEP* PSTs (n = 6,071 pearls from two mice; n = 4 layers 1 & 2/3 stacks, n = 4 
layer 5 stacks). i, Mean density by distance plots for GEP* PSTs versus synapsin 
I, bassoon, GAD 1/2, VAChT, VGAT, gephyrin, PSD-95 and parvalbumin 
from example stack. Crosses indicate means from real data, while lines denote 
mean values following 1,000 rounds of marker randomization. Error bars 
denote 99% confidence intervals. j, Mean densities within GFP* PSTs (0 
distance) for all markers and all stacks. Real and randomized data are indicated 
as in i. k, z score summary (n = 8 stacks) quantifying the differences in 
mean density for the synaptic markers shown in j (and not reported in Fig. 3e) 
within GFP™ PSTs (0 distance) for the real data and following ten rounds 

of randomization of GFP* volumes. Positive z scores indicate higher densities 
in the real data. *P < 0.001 for all stacks. 
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Extended Data Figure 9 | GP-FC cells receive glutamatergic synapses from 
the STN. a, Left, low-magnification view of parasagittal slice showing the 
GP following biocytin labelling of the STN and avidin-HRP/DAB visualization 
of STN projections. Right, high-magnification view of inset showing DAB- 
labelled projections in the GP and around the GP-nucleus basalis border. 

b, Schematic of experimental strategy to electrically stimulate STN projections 
to GP. A bipolar electrode was placed at the anterior border of STN and GP-FC 
cells were targeted for whole-cell voltage-clamp recording. c, Acute parasagittal 
slice showing location of the bipolar electrode and recording pipette (red 
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asterisk). d, e, Electrically evoked glutamatergic currents in GP-FC cells 
following stimulation of STN-GP axon tract. d, Left, example NBQX- 
subtracted AMPAR (Vpoig = —70 mV) then CPP-subtracted NUDAR 

(Viola = +40 mV) currents evoked in GP-FC cells under baseline conditions 
(SR95331, scopolamine, CGP55845). Right, summary of AMPAR and 
NMDAR peak currents in ChAT* (n = 6) and ChAT” (n = 4) GP-FC cells 
(from seven mice). e, Onset latencies of glutamatergic currents (ChAT™: 

2.3 + 0.3 ms, n = 6 cells; ChAT : 3.08 + 0.3 ms, n = 4 cells, from seven mice). 
Bars denote mean + s.e.m. 
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Extended Data Figure 10 | GP-FC cells receive GABAergic synapses from 
dorsal striatal iSPNs and dSPNs with different presynaptic properties. 

a, Left, sagittal sections from an Adora2a-Cre;Rosa26' ChR2-e¥FP/ + (top) or 
Drdla-Cre;Rosa26't PRY FP/+ (bottom) mouse where ChR2-eYFP is 
expressed in either iSPNs or dSPNs, respectively. Right, light was delivered first 
over the recorded cell (peak IPSCs in b) and then in dorsal striatum 
(pharmacological analysis in Fig. 4c and Extended Data Fig. 10h and 
presynaptic properties in f, g).c, d, SPNs from dorsal striatum arborize axons in 
and around the GP-nucleus basalis border but not in the basal forebrain. c, Left, 
sagittal section from a VGAT* “* mouse injected with rAAV DIO-mCherry 
into dorsal striatum. Right, higher-magnification view of inset. Axons from 
SPNs arborize in the GP proper and GP-nucleus basalis border regions (areas 1 
and 2) but not in the more ventral region of the basal forebrain (area 3, 
nucleus basalis proper/substantia innominata). d, Left, example maximum 
projection confocal stacks (z = 28-42 jim) from the inset regions in c. Right, 
binary axonal quantification from the regions indicated. SPN axon density is 
sharply reduced in nucleus basalis proper/substantia innominata. e, Synaptic 
connectivity screen for dorsal striatal SPN IPSCs onto ChAT™ neurons of 
the GP and basal forebrain. ChAT-GFP mice were injected with Cre-OffrAAV 


DO-ChR2-mCherry in dorsal striatum and whole-cell recordings were targeted 
to ChAT™ neurons (n = 23 cells, from four mice) in combination with 
optogenetic activation. NBQX and CPP were included in the bath to block 
glutamatergic transmission. Left, sagittal map of recording locations of 
ChAT” GFP* neurons with detected IPSCs (blue) and no detected IPSCs (red). 
Right, peaks and onset latencies for detected IPSCs. Green lines denote means. 
In every experiment, IPSCs onto ChAT GEP* neurons of the GP were 
detected before recordings were targeted to more ventral areas. f, g, Paired pulse 
optogenetic activation of iSPNs (Adora2a-Cre;Rosa26!st hR?--Y P/+) and 
dSPNs (Drdla-Cre;Rosa26'" 8? e¥FP/+) in dorsal striatum reveals differences 
in short-term synaptic plasticity properties in GP-FC cells. f, Examples of 
optogenetically evoked paired pulse IPSCs (interstimulus interval = 20, 50, 
100, 200 and 500 ms) from iSPNs (left) and dSPNs (right) in GP-FC cells. 

g, Mean 2nd/1st peak IPSC ratios (iSPN: n = 13 ChATt,n=9 ChAT cells 
from 11 mice; dSPN: n= 9 ChAT‘, n=8 ChAT cells from six mice). 

*P < 0.05 iSPNs versus dSPNs (ChAT and ChAT” grouped together, 
Mann-Whitney); error bars denote s.e.m. h, Example iSPN peak IPSCs in a 
ChAT GP-FC cell following application of quinpirole, sulpiride and SR95331. 
Inset, average IPSCs. Scale bar, 20 pA/10 ms. Rs, series resistance. 
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Vertically transmitted faecal IgA levels determine 
extra-chromosomal phenotypic variation 


Clara Moon'*, Megan T. Baldridge’*, Meghan A. Wallace’, Carey-Ann D. Burnham’, Herbert W. Virgin! 


& Thaddeus S. Stappenbeck! 


The proliferation of genetically modified mouse models has exposed 
phenotypic variation between investigators and institutions that has 
been challenging to control’*. In many cases, the microbiota is the 
presumed cause of the variation. Current solutions to account for 
phenotypic variability include littermate and maternal controls or 
defined microbial consortia in gnotobiotic mice’. In conventionally 
raised mice, the microbiome is transmitted from the dam”**. Here we 
show that microbially driven dichotomous faecal immunoglobulin- 
A (IgA) levels in wild-type mice within the same facility mimic the 
effects of chromosomal mutations. We observe in multiple facilities 
that vertically transmissible bacteria in IgA-low mice dominantly lower 
faecal IgA levels in IgA-high mice after co-housing or faecal trans- 
plantation. In response to injury, IgA-low mice show increased damage 
that is transferable by faecal transplantation and driven by faecal IgA 
differences. We find that bacteria from IgA-low mice degrade the 
secretory component of secretory IgA as well as IgA itself. These data 
indicate that phenotypic comparisons between mice must take into 
account the non-chromosomal hereditary variation between differ- 
ent breeders. We propose faecal IgA as one marker of microbial var- 
iability and conclude that co-housing and/or faecal transplantation 
enables analysis of progeny from different dams. 

We chose to study the role of secretory immunoglobulin A (SIgA), 
which is a critical intersection between the host immune system and 
the microbiota’®. While interrogating baseline intestinal IgA levels in 
wild-type (WT) C57BL/6] (B6) mice, we observed a binary phenotype 
in faecal IgA levels between cages (Fig. 1a): those with high faecal IgA 
(defined as 0.05-0.25 ug IgA per milligram of faeces) and those with nearly 
undetectable faecal IgA, hereafter designated as IgA-high and IgA-low 
mice, respectively. We observed this differential IgA phenotype in two 
separate facilities at our institution in independently derived WT B6 
colonies (Extended Data Fig. 1a). Although both facilities are specific 
pathogen-free, the protocols, access, and personnel are distinct. All exper- 
iments were performed in both facilities unless otherwise noted. Despite 
the profound difference in faecal IgA, serum IgA levels were similar be- 
tween these two groups, suggesting a gut-specific effect (Fig. 1b). The 
binary phenotype was passed from breeders to progeny, indicating a ver- 
tically transmissible phenotype (Fig. 1c). Furthermore, this phenotype 
was laterally transferable by co-housing IgA-high and IgA-low mice. 
Remarkably, both IgA-high and IgA-low mice were found to be IgA- 
low after co-housing (Fig. 1d). This result also occurred by cross-trans- 
fer experiments involving faecal transplantation between mice in our 
two facilities (Extended Data Fig. 1b, c). Hence, the IgA-low phenotype 
was dominant, indicating that faecal IgA levels can be regulated by sup- 
pression and not only induction. 

We next passaged microbes through polymeric Ig receptor mutant 
(pIgR ‘~) mice that lacked the ability to transport IgA into the lumen". 
This experiment allowed us to test whether the stability of the faecal 
microbiome creating this binary phenotype requires the presence of fae- 
cal SIgA (Extended Data Fig. 2a). Faecal samples from pIgR ‘~ mice 
transplanted with IgA-low material conferred the IgA-low phenotype 


to IgA-high WT mice (Extended Data Fig. 2b). Thus, exposure to a novel 
environment lacking SIgA (the plgR “ intestine) did not affect the 
ability of the faecal microbiota to regulate the IgA-high versus IgA-low 
phenotype. 

Because commensal bacteria and viruses modulate mucosal IgA’””’, 
we transplanted IgA-high mice with IgA-low faecal material filtered to 
remove large microbes (for example, bacteria, fungi). Mice transplanted 
with filtrate remained IgA-high, while mice transplanted with unfiltered 
material became IgA-low (Fig. le), implicating intestinal microbes and 
excluding filterable viruses. 

To determine whether specific microbial pools could induce the 
IgA-low phenotype, we pre-treated IgA-low mice with a broad-spectrum 
antibiotic cocktail (vancomycin, neomycin, ampicillin, and metroni- 
dazole; VNAM), then performed faecal transplant from IgA-high or 
IgA-low mice (Extended Data Fig. 2c). Transplantation with IgA-high 
microbes increased faecal IgA, indicating that VNAM eliminated IgA- 
low-associated microbes (Fig. 1f). We found that ampicillin but not met- 
ronidazole was sufficient to reverse the IgA-low phenotype, indicating 
ampicillin-sensitive microbe(s) were responsible for the IgA-low phe- 
notype (Fig. 1g and Extended Data Fig. 2d). Unlike VNAM, ampicillin 
treatment reversed the IgA-low phenotype without transplantation, sug- 
gesting VNAM eliminated both IgA-suppressive and IgA-inductive 
microbes while ampicillin eliminated only IgA-suppressive microbes 
(Fig. 1f, g). We assessed whether the faecal IgA status of treated mice 
was vertically transmissible, and found that VNAM-treated IgA-low 
mice transplanted with IgA-high samples gave rise to IgA-high pro- 
geny (Extended Data Fig. 2e). Taken together, these results support a 
model where the IgA-low phenotype is bacterially driven, transmiss- 
ible, and dominant. 

Previous studies have shown that plgR-‘~ mice are more suscept- 
ible to dextran sodium sulphate (DSS) injury'*’*. With DSS treatment, 
IgA-low mice lost significantly more weight than their IgA-high counter- 
parts (Fig. 2a) and exhibited increased distal colon ulceration (Fig. 2b, c). 
This DSS sensitivity could be secondary to diminished SIgA or altered 
microbial composition. 

To address these possibilities, we re-colonized WT and plgR ’~ 
mice with IgA-high or IgA-low faecal material after VNAM treatment 
(Extended Data Fig. 2a) and before DSS treatment (Extended Data Fig. 3a). 
As expected, WT + IgA-low mice showed enhanced DSS sensitivity 
compared with WT + IgA-high mice (Fig. 2d, e and Extended Data 
Fig. 3b). We observed increased weight loss and colonic ulceration in 
plIgR’~ mice compared with WT mice, a finding consistent with pre- 
vious reports'*"*. Interestingly, this sensitivity was independent of IgA- 
low microbes, as pIgR’’~ mice colonized with IgA-high or IgA-low 
microbes had no significant differences in weight loss or ulceration 
(Fig. 2d, eand Extended Data Fig. 3b). This finding implied that altered 
SIgA levels, and not the microbes themselves, caused increased DSS 
damage in WT + IgA-low mice. 

To study the mechanism by which IgA-low microbes suppress fae- 
cal IgA, we assessed IgA production and transport capacity by pIgR in 
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*These authors contributed equally to this work. 


90 | NATURE | VOL 521 | 7 MAY 2015 


©2015 Macmillan Publishers Limited. All rights reserved 


a b NS c 
Jee ay 0.205 Breeder Progeny 
a I 3 = . 
§ 0.08 4 204 3 
® = o 
8 a ie 
E = S 
5 3 g 
= <= — 
E 0.044 2104 5 
5 E ro 
Q 2 D 
oD ® Ea 
cs zy? x 
<< | a (Peg ae a 
4% 3 
0.00 T 0 T 
xe Xe) es Xe) 
os er Rs x 
d - * 
0.20 4 Initial After 0.20 
re phenotype co-housing Pr 
oO 
8 0164 * 8 0.16 
€ £ 
gs J © 
5 0.12 60:12 
= z 
5 0.08 4 = 0.08 
a a 
g se 
< 0.04 4 +t = 0.04 
3 = D 
0.00 tt oa. 0.00 
es eo es ye 
Rg er Fr 
0.20 4 0.20 4 
Dp QD 
o oO 
oO oO 
g 8 0.16 4 + 
£ = 
£ $0.12 4 
= = t 
= 0.08 
a g ‘tT 
aD oD 
= =0.04 | 
S 3 * 
0.09 LL 51 --t —— = - 
& & & 
- > ow as S se 
e x ye Y ea Pa ee y 
g sh ~ ye © ee 
& or & or 
g < g g 


Figure 1 | Low faecal IgA in WT mice is a vertically and horizontally 
transferable, dominant phenotype driven by ampicillin-sensitive bacteria. 
a, b, Faecal (a) and serum IgA (b) by enzyme-linked immunosorbent assay 
(ELISA). Mann-Whitney U-test: a, P< 0.0001, n = 40 (IgA-high), n = 34 
(IgA-low) mice; b, P = 0.4704, n = 12 per group. c, Faecal IgA from IgA-high 
(n = 10) and IgA-low (n = 11) breeders and their adult progeny (IgA-high 

(n = 12), IgA-low (n = 9)). d, Faecal IgA pre-(IgA-high (n = 9), IgA-low 

(n = 11)) and after co-housing (IgA-high (n = 8), IgA-low (n = 10)). One-way 
analysis of variance (ANOVA): c, F = 45.95, P< 0.0001; d, F = 15.56, 
P<0.0001. e, Faecal IgA from IgA-high mice pre- (n = 18) and post-faecal 
transplant (FT) with unfiltered (post-FT, n = 13) or 0.45-p1m-filtered faecal 
material (post-filter FT, n = 5) from IgA-low mice. f, g, Faecal IgA of post-FT 
mice pre-treated with (f) VNAM or (g) ampicillin (Amp). One-way ANOVA: 
e, F = 5.685, P = 0.0076; f, F = 16.15, P< 0.0001, n = 16 (pre-VNAM), n = 18 
(post-VNAM), n = 9 (post-IgA-high FT), n = 8 (post-IgA-low FT); 

g, F = 22.96, P< 0.0001, n = 22 (pre-Amp), n = 28 (post-Amp), n = 12 (post- 
IgA-high FT), n = 12 (post-IgA-low FT). All values are mean ~ s.e.m. Different 
footnote symbols indicate groups significantly different by Tukey’s multiple 
comparison test (P < 0.05). Dashed lines, limit of detection. 


IgA-high versus IgA-low mice. We found no difference in lamina pro- 
pria plasma cell numbers between these mice (Extended Data Fig. 4a-f), 
nor in immunofluorescence staining for plgR (Extended Data Fig. 4g-j). 

On the apical surface of intestinal epithelial cells, pIgR is cleaved and 
the extracellular portion is released into the lumen bound to its ligand 
(multimeric immunoglobulins containing the J chain, such as dimeric 
IgA)'®. This cleaved form of pIgR, called bound secretory component 
when complexed in SIgA, helps protect dimeric IgA from degradation 
by bacterial proteases’”’*. We hypothesized that low SIgA levels were 
secondary to secretory component degradation. By immunoblotting for 
plgR/secretory component in faecal samples and whole tissue, we found 
that IgA-high and IgA-low mice had comparable tissue pIgR levels 
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Figure 2 | The IgA-low phenotype alters susceptibility to DSS in an 
IgA-dependent manner. a-c, DSS treatment of IgA-high/IgA-low mice: 

a, percentage initial weight; b, percentage of ulcerated distal colon; 

c, representative haematoxylin and eosin-stained histological sections of 
IgA-high (n = 10) and IgA-low (n = 11) mice. d, e, DSS treatment of WT and 
pigR ’ mice after VNAM treatment +IgA-high/IgA-low faecal transplant: 
d, percentage initial weight; e, percentage of ulcerated distal colon. Two-way 
repeated measures ANOVA: a, column factor P = 0.0285; P< 0.001, Sidak’s 
multiple comparisons test, final time point, IgA-high (n = 10) and IgA-low 
(n = 11) mice; d, column factor P< 0.0001, n = 15 (WT + IgA-high), n = 18 
(WT + IgA-low), n = 20 (pigR’~ + IgA-high), n = 21 (pIgR /~ + IgA-low); 
Tukey’s multiple comparison test, final time point. Unpaired t-test: 

b, P= 0.0385, IgA-high (n = 10), IgA-low (n = 11) mice. One-way ANOVA: 
e, F= 8.272, P = 0.0007, n= 3 (WT + IgA-high), n = 6 (WT + IgA-low), 
n=8 (plgR ’— + IgA-high), n= 10 (pIgR-/~ + IgA-low). All values are 
mean + s.e.m. Different footnote symbols indicate groups significantly 
different by Tukey’s multiple comparison test (P< 0.05). Scale bars, 1 mm. 
Boxes, ulcerated areas. 


(Fig. 3a—c), but IgA-low mice had reduced faecal secretory component 
(Fig. 3a, d). To identify potential microbe(s) responsible for enhanced 
secretory component degradation in IgA-low mice, we performed 16S 
rDNA sequencing of IgA-high and IgA-low faecal samples. Compar- 
ison of samples within individual facilities revealed taxonomic bio- 
markers associated with faecal IgA levels (Extended Data Fig. 5a, b and 
Extended Data Table 1). The only genus-level IgA-low biomarker com- 
mon to both facilities was the Gram-negative faecal anaerobe Sutterella’” 
(Extended Data Figs 5a and 6 and Extended Data Table 1). We grew 
anaerobic cultures with faecal inoculum from IgA-high and IgA-low 
mice, which enriched for Sutterella in IgA-low-derived samples (Fig. 3e 
and Extended Data Table 2). When administered to IgA-high recipi- 
ents, IgA-high cultures maintained the IgA-high phenotype, while 
IgA-low cultures converted mice to the IgA-low phenotype, indicating 
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Figure 3 | The IgA-low phenotype correlates 
with an absence of faecal secretory component 
and the presence of specific microbial taxa 

in vivo, which can be anaerobically cultured 

in vitro. a, Representative anti-plgR/secretory 
component (SC) immunoblots (of n = 3 replicates) 
of intestinal tissue/faecal samples from IgA-high/ 
IgA-low mice. b-d, Quantification of immunoblots 
for (b) ileal, (c) colonic, and (d) faecal pIgR/ 
secretory component. Mann-Whitney U-test: 

b, P = 0.7424, n = 7 (IgA-high), n = 5 (IgA-low) 
mice; ¢, P = 0.1285, n = 10 (IgA-high), n = 15 
(IgA-low) mice; d, P = 0.0025, n = 7 (IgA-high), 
n= 5 (IgA-low) mice. e, Relative abundance of 
order/family/genus 16S rDNA sequence 
assignments of culture inoculate from IgA-high/ 
IgA-low samples (Pre-) or overnight cultures of 
these inoculates (Post-); n = 4 samples per group. 
Statistical analysis: Extended Data Table 2. 

f, Schematic of culturing/administration 
experiments. g, Faecal IgA measured in IgA-high 
mice pre- (n = 18) and post-cultured microbe 
administration (+IgA-high culture (n = 10), 
+IgA-low culture (n = 9)) depicted in f. One-way 
ANOVA: F = 18.60, P< 0.0001. All values are 
mean + s.e.m. Different footnote symbols indicate 
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causative IgA-low microbes were culturable in these conditions (Fig. 3f, g). 
From these results, we concluded the IgA-low-inducing microbes might 
include Sutterella species. 

We next employed cultured microbes to explore the mechanism of 
secretory component degradation. We generated polarized, differen- 
tiated monolayers of primary intestinal epithelial cells in Transwells”® 
and assessed dimeric IgA transport from the basolateral to the apical 


groups significantly different by Tukey’s multiple 
comparison test (P< 0.05). Dashed lines, limit 
of detection. 


compartment. Treatment with y-secretase inhibitor DAPT (N-(N- 
(3,5-difluorophenacetyl)-L-alanyl)-S-phenylglycine t-butyl ester) and 
lipopolysaccharide (LPS) robustly induced pIgR in these cells (Extended 
Data Fig. 7a), which is necessary for IgA transport across the epithe- 
lium. After addition of IgA to the basal compartment, we detected secret- 
ory component in the apical supernatant of DAPT + LPS-treated cells 
(Fig. 4a—c). We co-cultured monolayers with either pelleted bacteria or 


Figure 4 | Culturable anaerobic bacteria present 
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culture supernatants from IgA-high or IgA-low cultures in the apical 
compartment (Fig. 4a—c). In co-cultures with IgA-high pelleted bac- 
teria, we detected secretory component in apical supernatants at three 
and six hours with evidence of limited degradation, while co-cultures 
with IgA -low pelleted bacteria exhibited substantially greater secretory 
component degradation at 6 h. This secretory component degradation 
did not depend on the presence of IgA as free secretory component was 
also degraded by IgA-low microbes (Extended Data Fig. 7b-d). Incu- 
bation of monolayers with IgA-high/IgA-low culture supernatants did 
not cause secretory component degradation (Fig. 4a—c). Differences in 
secretory component levels were not due to differential epithelial cell 
plgR expression (Extended Data Fig. 7a). Freeze/thaw of IgA-low cul- 
tures also led to secretory component degradation, indicating that live 
bacteria were not necessary (Extended Data Fig. 7e-g). Taken together, 
these data suggest bacteria in the IgA-low microbiota degrade free and 
bound secretory component in vitro (Fig. 4a—c), consistent with our 
observation of absent faecal secretory component in vivo (Fig. 3a, d). 
Addition ofa broad-spectrum protease inhibitor cocktail partially pre- 
vented secretory component degradation, implicating proteases in this 
process (Fig. 4d-f). These findings are consistent with a model in which 
degradation of bound secretory component of SIgA by IgA-low mi- 
crobes makes the IgA portion more susceptible to proteolysis. Bacteria 
make proteases that can cleave human IgA1/IgA2 and secretory com- 
ponent, although to our knowledge this has not been addressed in 
murine models*’”. In addition to secretory component degradation, 
we found that faecal IgA was degraded and thus decreased after co- 
culture with IgA-low microbes, consistent with our initial in vivo obser- 
vations (Extended Data Fig. 8a, b). In the future it will be of interest to 
identify the protease(s) involved in the degradation of secretory com- 
ponent and/or IgA, and to look for additional host substrates of these 
proteases. 

This study shows that phenotypic effects can be vertically transmit- 
ted through the microbiome, which can mimic alterations of host genes. 
To distinguish host genetic from extra-chromosomal effects, ideally 
mice must be bred so that comparisons can be performed between 
WT and mutated mice that have equivalent microbial exposure. 
Hence, breeding mice that are heterozygous for a given mutation is 
critical, even for the study of extra-intestinal phenotypes***. Secondary 
options are faecal transplantation and co-housing. These can serve as 
methodological controls for phenotypic variation dependent on extra- 
chromosomal factors that may be easily transmissible between hosts. 
Lastly, faecal IgA serves as a readily measurable marker that can be 
compared within and between facilities or institutions to compare 
phenotypic differences. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Mice. Animal protocols were approved by the Washington University Animal 
Studies Committee. All mice were maintained in one of two specific pathogen-free 
barrier facilities, with different procedures for maintaining food, water, and caging. 
In facility 2 (specialized research facility), complete cages (including food, bedding, 
isolator top, wires, and cage) were autoclaved after assembly, water was autoclaved 
and kept sterile, and a higher concentration of disinfectant was used (1:5:1 Clidox, 
Pharmacal Research Laboratories). Facility 1 (clinical sciences research building fa- 
cility) also used autoclaved cage components but cages were assembled after auto- 
claving. The food was irradiated but not autoclaved, and a lower concentration of 
disinfectant was used (1:18:1 Clidox). 

For facility 2, C57BL/6) WT mice were originally obtained from Jackson Labo- 

ratories (stock number 000664) and maintained as a breeding colony. For facility 1, 
multiple sources of C57BL/6] WT mice were used to create breeding colonies. 
Within our WT mouse colonies, multiple IgA-high and IgA-low breeders were iden- 
tified and maintained as independent lines. The plgR knockout mice (B6.129P2- 
Pigr"™''*°/Mmmh") were initially obtained from the Mutant Mouse Regional 
Resource Center and were backcrossed to 98.4% C57BL/6J. Male and female mice 
between 2 and 6 months of age were used, and faecal samples were only collected 
after mice were at least 8 weeks of age. Sample sizes used for studies reflect the 
number of mice needed for three independent experiments with at least two mice 
used per group in each experiment, and n reflects individual mice that were unique 
biological replicates. For animal studies, mice were confirmed to be IgA-high or 
IgA-low before subsequent experimental manipulation. 
Mouse treatments. For co-housing studies, mice were co-housed 1:1 for 14 days. 
For faecal transplantation, faecal samples were collected from mice and re-sus- 
pended in sterile PBS to a final concentration of 200 mg ml’ by weight. Mice were 
orally administered 25 il of the faecal mixture on two consecutive days. 

Antibiotic treatments included 0.5 mg ml‘ vancomycin, 1 mgml' neomycin, 
1mgml ‘ampicillin,and 1 mg ml‘ metronidazole (Sigma); VNAM indicates the 
cocktail for these combined antibiotics”. 

For DSS experiments, 2.5% DSS (TdB Consultancy) was administered in drink- 
ing water for 11 days. Mice were weighed daily and killed at 70% of initial body 
weight if needed. Intestines were taken for histology. All mice used in DSS experi- 
ments were from facility 2. 

Preparation of faecal samples for ELISA and immunoblotting. Faecal samples 
were collected from mice and re-suspended in sterile PBS to a final concentration 
of 100 mg ml by weight. Supernatants were collected and stored at —20 °C until 
needed. 

Preparation of faecal samples for bacterial culture. Faecal samples were collected 
from mice and re-suspended in sterile PBS to a final concentration of 200 mg ml" 
by weight. Of this mixture, 250 il was used to inoculate anaerobic chopped meat 
broth in Hungate tubes (Fisher Scientific) for overnight culturing in a 37 °C shak- 
ing incubator”. Individual culture samples reflect unique biological replicates. For 
culture administration to mice, 25 ull were orally administered for two consecutive 
days. The faecal suspensions as well as the overnight cultures were used in epithe- 
lial co-culture experiments described below. 

16S rDNA Illumina sequencing and analysis. Faecal pellets (individual bio- 
logical replicates) were collected into 2-ml tubes (Sarstedt) with 1 mM diameter 
zirconia/silica beads (Biospec). Phenol:Chloroform:IAA (25:24:1, pH 8.0) (Fisher) 
and Buffer A (200 mM NaCl, 200 mM Tris, 20 mM EDTA) (500 pl of each) were 
added to the samples, as well as 210 tl of 20% SDS, and samples were homogenized 
for 1 min at maximum speed with a MiniBeadBeater24 (Biospec). Samples were 
centrifuged for 5 min at maximum speed, the aqueous phase was transferred and 
added to 500 il of Phenol:Chloroform:IAA and gently mixed, and samples were 
re-centrifuged. The aqueous phase was added to 500 ul of isopropanol, stored at 
—80°C for 20 min, and spun at maximum speed at 4 °C for 20 min. The resulting 
pellet was then washed with 100% ethanol, and re-suspended in 50 Ll of water. Of 
the sample, 25 tl was then cleaned with the 96-well format DNeasy Blood & Tissue 
Kit (Qiagen) according to the manufacturer’s instructions, and purified DNA 
samples were re-suspended at 25ngpl~'. For DNA purification from bacterial 
culture inoculate and anaerobic bacterial cultures (all individual biological repli- 
cates), the QlAamp DNA mini kit (Qiagen) was used according to the manufac- 
turer’s protocol. Primer selection and polymerase chain reaction (PCR) were per- 
formed similarly as described previously”’. Briefly, each sample was amplified in 
triplicate, combined, and confirmed by gel electrophoresis, with Golay-barcoded 
primers specific for the V4 region (F515/R806). PCR solutions contained 18.8 1 
RNase/DNase-free water, 2.5 11 10% High Fidelity PCR Buffer (Invitrogen), 0.5 ul 
10mM dNTPs, 1 150 mM MgSO,, 0.5 pl each of the forward and reverse primers 
(10 LM final concentration), 0.1 pl Platinum High Fidelity Taq (Invitrogen), and 
1.0 pl genomic DNA. Reactions were held at 94 °C for 2 min to denature the DNA, 
with amplification proceeding for 26 cycles at 94°C for 15s, 50°C for 30s, and 
68 °C for 30s; a final extension of 2 min at 68 °C was added to ensure complete 


amplification. Amplicons were pooled and purified with 0.6 Agencourt Ampure 
XP beads (Beckman-Coulter) according to the manufacturer’s instructions. The 
final pooled samples, along with aliquots of the three sequencing primers, were sent 
to the Center for Genome Sciences (Washington University School of Medicine) for 
sequencing by the 2 X 250-base pair protocol with the Illumina MiSeq platform. 

We analysed 16S sequences with Quantitative Insights Into Microbial Ecology 
(QIUIME, version 1.8.0)**. Raw sequence fastq files were quality filtered and demul- 
tiplexed with the following criteria: the maximum number of consecutive low- 
quality base calls allowed was 3, the minimum number of consecutive high-quality 
base calls was greater than 75% of the input sequence length, the PHRED quality 
threshold was set to 19, and reverse-complement mapping barcodes were used. 
Closed-reference operational taxonomic units sharing 97% identity were clustered 
with the UCLUST algorithm” and assigned taxonomy according to the Greengenes 
database (version 13.8)°°. Faecal samples were rarefied to 1,000 sequences for sub- 
sequent analyses and culture/culture inoculate samples were rarefied to 5,000 
sequences. For culture/culture inoculate samples, a minimum relative abundance 
of 0.005 in at least one sample was set to filter out very rare operational taxonomic 
units before subsequent analysis. 

Relative operational taxonomic unit abundance data were input into LEfSe to 
determine biomarkers with significant linear discriminant analysis effect size’’. 
Biomarkers for facility 1 and for facility 2 alone were identified by comparison of 
samples within each facility. Biomarkers for facilities 1 and 2 were identified by 
comparison of samples from both facilities. After Kruskal-Wallis analysis (with an 
a value of 0.05) of all features, a linear discriminant analysis model was used to rank 
discriminant features by the effect size with which they differentiated classes. The 
threshold for logarithmic linear discriminant analysis score for discriminative 
features was set at 2.0. Biomarkers were graphically annotated on a taxonomic tree 
with GraPhlAn (publicly available at http://huttenhower.sph.harvard.edu/graphlan). 
Primary intestinal epithelial cell culture. Primary colonic epithelial stem cells 
were isolated, grown, and maintained as three-dimensional spheroid cultures in 
Matrigel (BD Biosciences) as described previously**”’. Cells were kept in 50% L- 
WRN conditioned media. Media were changed every 2 days, and cells were pas- 
saged every 3 days (1:3 split). Primary intestinal epithelial cell monolayers were 
formed as described previously”. Briefly, spheroids were recovered from 3-day- 
old three-dimensional Matrigel cultures, trypsinized, dissociated to single cells by 
vigorous pipetting, and re-suspended in 50% L-WRN conditioned media contain- 
ing 10 1M Y-27632 (R&D Systems). These cells were plated in Transwell inserts 
(Corning Costar) coated with 1 mg ml gelatin. Each individual experiment using 
the colonic epithelial stem cells reflects unique biological replicates. 

Cell treatments. On day one (24h after seeding the Transwells) the 50% L-WRN 
conditioned media supplemented with Y-27632 was removed and replaced with 
0% conditioned media (Advanced DMEM/F12 containing 20% fetal bovine serum 
(FBS), 100 units of penicillin, 0.1 mg ml} streptomycin, and 2 mM L-glutamine). 
At this time, any additional treatments were also administered to the cells: 1 pg ml — " 
LPS (Sigma) and 10 uM DAPT y-secretase inhibitor (Millipore). Cells were given 
fresh media with the respective treatments on day two, and were treated for a total 
of 48 h before being used for transcytosis of dimeric IgA on day three. 

IgA transcytosis assay. On day three, the Transwells were removed from the 
various treatment conditions, and switched to base media (Advanced DMEM/ 
F12 supplemented with 2 mM L-glutamine only; no FBS, no antibiotics). Six hun- 
dred microlitres of base media containing 3 ug of mouse IgA (BD Pharmingen) 
was added to the lower compartment (final concentration of IgA = 5 wg ml *). In 
some experiments, 600 il of base media alone (no IgA) was added to the lower 
compartment. Base media (100 ul, with or without various treatments) was added 
to the upper compartment. Treatments included 1/10 faecal bacterial suspensions 
(bacterial pellet or supernatant fractions), 1/10 overnight anaerobic chopped meat 
bacterial cultures (live or freeze/thawed), and 1X cOmplete Protease Inhibitor 
Cocktail (Roche). The apical supernatant or cells were collected at 3h or 6h to 
evaluate the amount of pIgR/secretory component by immunoblotting or IgA by 
ELISA (Immunology Consultants Labs). Each experiment reflects unique bio- 
logical replicates. 

Immunostaining and histological analysis. For whole-tissue immunostaining, 
mouse colons were harvested and prepared as previously described™*. Transverse 
sections (5 jum thick) were cut for haematoxylin and eosin staining and immuno- 
staining. For this procedure, the sections were de-paraffinized, hydrated, boiled in 
Trilogy solution (Cell Marque) for 20 min, rinsed in PBS, blocked with 10 mg ml 
bovine serum albumin/0.1% Triton-X100 for 30 min, and incubated with primary 
antibody at 4°C overnight. Primary antibodies included goat anti-mouse pIgR 
(1/500, R&D Systems, catalogue number AF2800) and goat anti-mouse IgA- 
Alexa Fluor 488 (1/200, Serotec, catalogue number STAR137F). The slides were 
rinsed three times in PBS and incubated with Alexa Fluor 594-conjugated species- 
specific secondary antibody for 1h at room temperature (20-22 °C) (1/500, 
Invitrogen, catalogue number A11058) ifneeded. Slides were washed three times in 
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PBS and stained with bis-benzimide/Hoechst (Invitrogen) to visualize nuclei and 
mounted with a 1:1 PBS:glycerol solution. Staining was visualized with a Zeiss 
Axiovert 200 microscope with an Axiocam MRM digital camera. 
Immunoblotting. Protein was isolated from intestinal tissue segments of ~1 cm 
in 500 pl RIPA buffer with protease inhibitors with the Fastprep bead-beater 
system (MP Bio, BioSpec). Samples were subjected to four rounds of lysis at speed 
6 for 20s at 4°C. Primary intestinal epithelial cells were lysed in Transwells with 
50 pl RIPA buffer with protease inhibitors (Sigma). Total protein was quantified 
by a Pierce BCA Protein Assay Kit (Thermo Scientific). Supernatants from faecal 
samples and Transwells were taken as described above. Samples were run on SDS- 
polyacrylamide gel electrophoresis gels (AnykD or 7.5% Mini-Protean TGX gels, 
Bio-Rad) and transferred onto nitrocellulose membrane (Bio-Rad). Membranes 
were blocked with 3% milk in 0.1% Tween-20 Tris-buffered saline for 1 h at room 
temperature and probed with goat IgG anti-plgR/secretory component (R&D, 
catalogue number AF2800) and rabbit IgG anti-Actin (Sigma, catalogue number 
2066) overnight at 4 °C. Blots were incubated for 1 h with horseradish-peroxidase- 
conjugated secondary antibodies (Invitrogen catalogue number A16005, BioRad 
catalogue number 170-6515) before development with a SuperSignal West Dura 
chemiluminescent kit (Thermo Scientific). Immunoblots were quantified by ImageJ 
software*’. Whole-tissue plgR/secretory component was normalized to actin, faecal 
samples were normalized by weight as described above, and apical supernatants 
were normalized by volume. 

Statistical analysis and experimental design. No statistical methods were used to 
predetermine sample size. Statistical significance between two groups was deter- 
mined by unpaired Student’s t-test if the data passed the D’Agostino-Pearson 
normality test or by Mann-Whitney U-test if the data did not pass the normality 
test. P value calculations were two-tailed. Comparison of more than two groups was 
performed with one-way ANOVA followed by Tukey’s multiple comparisons test, 
or two-way repeated measures ANOVA in Prism GraphPad software. Methods and 
P values are detailed in figure legends. Additional details of 16S rDNA analysis are 
included in the 16S rDNA Illumina sequencing and analysis’ section above. Letter n 
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refers to the number of mice per group unless otherwise noted. All samples reflect 
unique biological replicates. 

Inclusion of in vitro experiments was dependent upon expected performance of 
positive and negative controls. IgA ELISAs were performed blinded by a single 
investigator. Histological and immunofluorescence observations were performed 
blinded by two independent investigators. Samples were assessed in random order 
after being assigned numbers. Animals initially co-housed were randomly distrib- 
uted to experimental groups, although no investigator blinding occurred during 
the execution of animal experiments. 
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Extended Data Figure 1 | WT mice within two independent facilities exhibit 
binary faecal IgA levels, and the IgA-low phenotype is transferable between 
these facilities. a, Faecal IgA (normalized to faecal weight) from mice housed 
in either facility 1 (n = 28 IgA-high and n = 22 IgA-low mice) or facility 2 

(n = 12 mice per group) was detected by anti-mouse IgA ELISA. b, c, WT IgA- 
high mice from one mouse facility were transplanted with homogenized faecal 
material from WT IgA-high or IgA-low mice from the other mouse facility, and 
faecal IgA was measured 14 days later by anti-mouse IgA ELISA. b, Facility 1 


mice pre- (n = 18 mice) and post-faecal transplantation with facility 2 faecal 
samples (n = 8 post-IgA-high and n = 10 post-IgA-low mice). ¢, Facility 2 mice 
pre- (n = 10 mice) and post-faecal transplantation with facility 1 faecal samples 
(n = 4 post-IgA-high and n = 6 post-IgA-low mice). The dotted lines represent 
the limit of detection by ELISA. All values are mean ~ s.e.m. One-way 
ANOVA: a, F = 44.59, P< 0.0001; b, F = 20.93, P< 0.0001; c, F= 12.92, 

P= 0.0004. Means with different footnote symbols are significantly different by 
Tukey’s multiple comparison test (P< 0.5). 
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Extended Data Figure 2 | IgA- high- and IgA-low-associated microbes can _ day 30 pre-treated with metronidazole (Metro). One-way ANOVA: F = 6.525, 


be stably passaged through pIgR ‘~ recipients, and are vertically P= 0.0012, n = 13 (pre-Metro), n = 15 (post-Metro), n = 8 (post-IgA-high 
transmissible after recolonization. a, Schematic for repopulation of plgR ‘" FT), and n = 5 (post-IgA-low FT). All values are mean = s.e.m. e, IgA-low mice 
microbiota with WT IgA-high/IgA-low samples, followed by faecal converted to IgA-high from Fig. 1f were mated, and faecal IgA of their adult 


transplantation (FT) of pigR ’~ IgA-high or IgA-low samples to WT IgA-high progeny was measured. One-way ANOVA: F = 18.29, P = 0.0002, n = 2 


mice. b, Faecal IgA on day 44 depicted in a. Ma 


nn-Whitney U-test: P= 0.0006, _ breeders, n = 10 progeny from four litters. Different footnote symbols indicate 


n = 8 mice per group. c, Experimental schematic of antibiotic treatment groups significantly different by Tukey’s multiple comparison test (P< 0.5). 


and transplant protocol for d and Fig. 1f, g. d, 


Faecal IgA of post-FT mice on Dotted lines: limit of detection. 
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Extended Data Figure 3 | DSS effects on plIgR~’~ mice are dependent on 
IgA and not microbes. a, Faecal IgA levels were measured in WT mice 
from Fig. 2d, e after VNAM treatment and IgA-high/IgA-low faecal 
transplantation (FT), before the start of DSS treatment. Statistical analysis by 
Mann-Whitney U-test: P = 0.0006, n = 7 mice per group. b, Representative 
haematoxylin and eosin-stained histological sections of WT and pigR-/~ 


mice from Fig. 2d, e after 14 day VNAM treatment + IgA-high/IgA-low 
faecal transplantation. Representative of n = 3 (WT + IgA-high), n = 6 

(WT + IgA-low), n = 8 (pIgR “~ + IgA-high), and n = 10 (pigR’’ + IgA- 
low) mice. All values indicated as mean + s.e.m. Means with different footnote 
symbols are significantly different by Tukey’s multiple comparison test 

(P <0.5). Dotted lines, limit of detection. 
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X20 field (area = 1.5 um X 10° pm) (f) based on IgA staining. All values are 
mean ~ s.e.m. Statistical analysis by Mann-Whitney U-test: e, P= 0.5191, 
n= 10 mice per group; f, P = 0.3117, n = 10 IgA-high and n = 9 IgA-low mice. 
g-j, Ileal and colonic sections from IgA-high and IgA-low mice were stained 
with anti-pIgR/secretory component (red) and bis-benzamide dye (blue); 
representative images are shown (n = 10 mice per group). Scale bars, 100 um. 


Extended Data Figure 4 | Plasma cell numbers and pIgR expression are 
unchanged in the ileum and colon between IgA-high and IgA-low mice. 
a-d, Ileal and colonic sections from IgA-high and IgA-low mice were stained 
with anti-IgA (green) and bis-benzamide dye (blue); representative X20 
images are shown of n = 10 (a-c) or n = 9 mice (d). Scale bars, 100 um. 

e, f, Quantification of ileal plasma cells per villus (e) and colonic plasma cells per 
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Extended Data Figure 5 | 16S rDNA sequencing identifies biomarkers for 
IgA-low and IgA-high samples. a, b, LEfSe analysis*' of 16S rDNA sequencing 
of IgA-low and IgA-high faecal samples from facilities 1 and 2 identified 
statistically significant bacterial taxa biomarkers for (a) IgA-low and (b) 
IgA-high samples. Biomarkers for facility 1 and biomarkers for facility 2 alone 
were identified by comparison of IgA-high and IgA-low samples within each 
facility. Biomarkers for facilities 1 and 2 were identified by comparison of 
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all IgA-high and IgA-low samples from both facilities. No IgA-high biomarkers 
were identified when comparing all IgA-high and IgA-low samples from both 
facilities. Biomarkers for the indicated groups are plotted as taxonomic trees 
with GraPhlAn (http://huttenhower.sph.harvard.edu/graphlan); n = 13 
(facility 1 IgA-high), n = 14 (facility 1 IgA-low), n = 73 (facility 2 IgA-high), 
and n = 68 (facility 2 IgA-low) samples. Statistical analysis is shown in 
Extended Data Table 1. 
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Extended Data Figure 6 | Sutterella is more abundant in IgA-low samples —_1 = 14 (facility 1 IgA-low), n = 73 (facility 2 IgA-high), and n = 68 (facility 2 
than IgA-high samples in both facilities. a—c, Relative abundance of IgA-low) samples. Values in c are indicated as mean ~ s.e.m. Means with 
sequences assigned by QIIME to the bacterial genus Sutterella from 16S rDNA __ different footnote symbols are significantly different by Tukey’s multiple 
analysis in (a) facility 1 and (b) facility 2. These results are summarized in comparison test (P< 0.5). 


c. One-way ANOVA: F = 12.85, P< 0.0001. n = 13 (facility 1 IgA-high), 
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Extended Data Figure 7 | IgA-low cultured bacteria can degrade free secretory 
component in the absence of IgA, and secretory-component-degrading 
properties of these bacteria are active after freeze/thaw. Primary intestinal 
epithelial Transwell monolayers were pre-treated with 10 1M DAPT + 1 pgml* 
LPS on days 1 and 2 post-seeding to induce differentiation and pIgR expression. 
Some wells were left untreated as negative controls. On day 3 post-seeding, either 
3 jig of normal mouse dimeric IgA or media alone was added to the lower 
compartment of the Transwells. Different subsets of the DAPT + LPS-treated 
Transwells were also treated with one of the following conditions in the apical 
compartment: IgA-high/IgA-low bacterial cultures (pelleted bacterial or 
supernatant fraction), live or freeze/thawed IgA -high/IgA-low bacterial cultures 
(pelleted bacterial fraction). Apical Transwell supernatants were collected at 3h 
and 6 h, and the amount of secretory component was measured by anti-secretory- 
component immunoblot. a, Representative anti-pIgR/secretory component and 
anti-actin immunoblots of intestinal epithelial monolayers at 6 h (one of three 


Area (x10,000 pixels) 


experiments). b-d, Secretory component degradation in the absence of IgA. 

(b) Representative anti-secretory-component immunoblot and quantification of 
undegraded secretory component (denoted by the red brackets) at 3h (c) and 6h 
(d) over four independent experiments by ImageJ. e-g, Secretory component 
degradation by freeze/thawed bacterial cultures. e, Representative anti-secretory- 
component immunoblot and quantification of undegraded secretory component 
at 3h (f) and 6h (g) over five independent experiments by ImageJ. All values are 
mean + s.e.m. One-way ANOVA: ¢, F = 1.834, P = 0.1831, n = 4 independent 
experiments with DAPT + LPS, no IgA repeated twice; d, F = 23.96, P = 0.0002, 
n= 4 independent experiments with DAPT + LPS, no IgA repeated twice; 

f, F= 7.444, P = 0.0045, n = 5 independent experiments with untreated and 
DAPT + LPS repeated three times; g, F = 31.53, P< 0.0001, n = 5 independent 
experiments with untreated and DAPT + LPS repeated three times. Means with 
different footnote symbols are significantly different by Tukey’s multiple 
comparison test (P< 0.5). 
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Extended Data Figure 8 | IgA-low cultured bacteria can degrade IgA. 
Primary intestinal epithelial cell monolayers were pre-treated with 10 1M 
DAPT + 11g ml‘ LPS on days 1 and 2 post-seeding to induce differentiation 
and pIgR expression. Some wells were left untreated as negative controls. 

On day 3 post-seeding, 3 j1g of normal mouse IgA was added to the lower 
compartment of the Transwells. Different subsets of the DAPT + LPS-treated 
Transwells were also treated with combinations of the following in the apical 
compartment: live IgA-low bacterial cultures (either the pelleted bacterial or 
supernatant fraction), freeze/thawed IgA-low bacterial cultures, and a 1X 
protease inhibitor (PI) cocktail. Apical Transwell supernatants were collected at 
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3h (a) and 6h (b), and the amount of IgA was measured by anti-mouse IgA 
ELISA. The dotted lines represent the limit of detection by ELISA. All values 
are mean = s.e.m. One-way ANOVA: a, F = 26.32, P< 0.0001, n = 8 
(untreated), n = 8 (DAPT + LPS), n = 6 (IgA-low culture, pellet), n = 3 
(IgA-low culture, supernatant), n = 4 (IgA-low culture, freeze/thawed), and 
n= 4 (IgA-low culture, +PI); b, F = 35.57, P< 0.0001, n = 8 (untreated), 
n= 8 (DAPT + LPS), n = 6 (IgA-low culture, pellet), n = 3 (IgA-low culture, 
sup), n = 3 (IgA-low culture, freeze/thawed), andn = 4 (IgA-low culture, + PI). 
Means with different footnote symbols are significantly different by Tukey’s 
multiple comparison test (P< 0.5); ND, not detected. 
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Extended Data Table 1 | 16S rDNA analysis of faecal samples from two facilities identified discriminant biomarkers between IgA-high and 
IgA-low samples 


All Sample Comparison — Facility 1 Comparison Facility 2 Comparison 
LDA Effect Size LDA Effect Size LDA Effect Size 
Discriminant Biomarker (log 10) P-value IgA (log10) P-value IgA (log10) P-value IgA 
Proteobacteria.Betaproteobacteria 3.37952 4.28E-17 Low 3.35270 0.004907 Low 3.43121 1.9E-15 Low 
Proteobacteria.Betaproteobacteria.Burkholderiales.Alcaligenaceae.Sutterella 3.35757 4.28E-17 Low 3.35295 0.004907 Low 3.39543 1.9E-15 Low 
Bacteroidetes. Bacteroidia.Bacteroidales. Bacteroidaceae 3.81243 0.003826 High 3.46226 7.65E-05 Low 
Bacteroidetes. Bacteroidia.Bacteroidales. Bacteroidaceae.Bacteroides 3.81243 0.003826 High 3.45972 7.65E-05 Low 
Bacteroidetes. Bacteroidia.Bacteroidales.Prevotellaceae.Prevotella 3.46939 0.008456 High 3.94783 6.31E-16 Low 
Bacteroidetes.Bacteroidia. Bacteroidales. Rikenellaceae 3.72308 0.015021 High 3.82644 1.56E-12 Low 
Bacteroidetes.Bacteroidia.Bacteroidales 3.20353 0.009979 High 3.13301 1.6E-12 Low 
Firmicutes.Clostridia.Clostridiales. Christensenellaceae 3.53769 0.016585 High 3.33080 0.036190 Low 
Actinobacteria.Coriobacteriia.Coriobacteriales.Coriobacteriaceae 3.22514 0.044352 Low 
Bacteroidetes.Bacteroidia. Bacteroidales. Paraprevotellaceae 3.67592 9.65E-05 Low 
Bacteroidetes.Bacteroidia.Bacteroidales.Paraprevotellaceae.Prevotella 3.67592 9.65E-05 Low 
Bacteroidetes.Bacteroidia. Bacteroidales. Prevotellaceae 3.82596 9.65E-05 Low 
Deferribacteres 3.35837 0.005758 Low 
Deferribacteres. Deferribacteres 3.35874 0.005758 Low 
Deferribacteres. Deferribacteres. Deferribacterales 3.36137 0.005758 Low 
Deferribacteres. Deferribacteres. Deferribacterales. Deferribacteracea 3.35772 0.005758 Low 
Deferribacteres.Deferribacteres. Deferribacterales. Deferribacteraceae.Mucispirillum 3.36082 0.005758 Low 
Proteobacteria.Gammaproteobacteria 3.30927 0.019810 Low 
Proteobacteria.Gammaproteobacteria.Enterobacteriales 3.30713 0.019810 Low 
Proteobacteria.Gammaproteobacteria.Enterobacteriales.Enterobacteriaceae 3.31707 0.019810 Low 
Firmicutes.Clostridia.Clostridiales 4.57889 0.000680 High 
Firmicutes.Clostridia.Clostridiales. Lachnospiraceae 3.82545 0.017366 High 
Firmicutes.Clostridia.Clostridiales. Lachnospiraceae.Dorea 3.31929 0.026676 High 
Firmicutes.Clostridia.Clostridiales. Ruminococcaceae.Oscillospira 3.84527 0.006447 High 
Firmicutes.Clostridia.Clostridiales. Ruminococcaceae.Ruminococcus 3.77812 0.000499 High 
Proteobacteria.Alphaproteobacteria.RF32 3.27961 0.008611 High 
Proteobacteria.Epsilonproteobacteria.Campylobacterales. Helicobacteraceae 3.62530 0.000393 High 
Tenericutes 3.27403 0.001065 High 
Tenericutes.Mollicutes 3.25535 0.003120 High 
Tenericutes.Mollicutes.RF39 3.25409 0.003120 High 
Bacteroidetes 4.70727 0.000397 Low 
Bacteroidetes.Bacteroidia 4.70727 0.000397 Low 
Bacteroidetes.Bacteroidia.Bacteroidales.Odoribacteraceae 3.41723 1.22E-18 Low 
Bacteroidetes.Bacteroidia. Bacteroidales. Odoribacteraceae.Odoribacter 3.47274 1.22E-18 Low 
Bacteroidetes.Bacteroidia. Bacteroidales. Porphyromonadaceae 3.11464 0.000676 Low 
Bacteroidetes.Bacteroidia. Bacteroidales. Porphyromonadaceae.Parabacteroides 3.06586 0.000676 Low 
Bacteroidetes.Bacteroidia.Bacteroidales.S24_7 4.49336 0.034288 Low 
Firmicutes.Clostridia.Clostridiales. Dehalobacteriaceae 3.44482 0.010761 Low 
Firmicutes.Clostridia.Clostridiales. Dehalobacteriaceae. Dehalobacterium 3.45906 0.010761 Low 
Proteobacteria 3.38346 8.98E-13 Low 
TM7 3.94808 0.013973 Low 
TM7.TM7_3 3.79262 0.013973 Low 
TM7.TM7_3.CW040 3.60561 0.013973 Low 
TM7.TM7_3.CW040.F16 3.84749 0.013973 Low 
Cyanobacteria 3.19671 0.000962 High 
Cyanobacteria.4COd_2 3.38513 0.001431 High 
Cyanobacteria.4COd_2.YS2 3.31490 0.001431 High 
Firmicutes 4.64404 0.003823 High 
Firmicutes.Clostridia.Clostridiales.Clostridiaceae 3.92271 2.01E-05 High 
Firmicutes.Clostridia.Clostridiales.Clostridiaceae.02d06 2.56784 0.024838 High 
Firmicutes.Clostridia.Clostridiales.Clostridiaceae.Clostridium 2.54472 0.019193 High 
Firmicutes.Clostridia.Clostridiales.Clostridiaceae.Sarcina 2.56391 0.021179 High 
Firmicutes.Clostridia.Clostridiales. Peptococcaceae.rc4_4 3.26329 0.000457 High 
Firmicutes.Clostridia.Clostridiales. Peptostreptococcaceae 2.74377 0.034900 High 
Firmicutes.Clostridia.Clostridiales.Ruminococcaceae 3.68674 0.000262 High 
Firmicutes.Erysipelotrichi.Erysipelotrichales.Erysipelotrichaceae 2.56710 0.014173 High 
Tenericutes.Mollicutes.Anaeroplasmatales 3.43058 0.012537 High 
Tenericutes.Mollicutes.Anaeroplasmatales.Anaeroplasmataceae 3.41372 0.012537 High 
Tenericutes. Mollicutes.Anaeroplasmatales.Anaeroplasmataceae.Anaeroplasma 3.43713 0.012537 High 


The relative abundance of operational taxonomic units from 16S rDNA amplification of faecal sample DNA was determined with QIIME, and these data were input into LEfSe*! to identify discriminant biomarkers. 
After Kruskal-Wallis analysis (with an « value of 0.05) of all features, a linear discriminant analysis model was used to rank discriminant features by the effect size with which they differentiated classes, in this case 
IgA-high versus lgA-low samples. We performed analysis to discriminate between IgA-high and IgA-low samples from both facilities (‘All Sample Comparison’), from facility 1 only (‘Facility 1 Comparison’) or facility 
2 only (‘Facility 2 Comparison’). Biomarkers are depicted as phylum.class.order.family.genus, to the level to which the taxonomic units were assignable by QIIME. ‘IgA’ indicates whether the biomarker was enriched 
in IgA-high samples (‘High’) or IgA-low samples (‘Low’). 
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Extended Data Table 2 | IgA-low sample cultures enrich for unique bacterial taxa compared with cultures from IgA-high samples 


One-way 
Mean and Standard Deviation analysis of Tukey's multiple comparisons test 
variance 
Pre Pre Pre Post Post Pre 
Bacterial T Se, Vepod. Veee bon we BW ve Pod ve Fea 
acterial Taxa hi hi 7 if . : ; : , f 
lgAchigh Igechigh' IgAdew' _igh-lew IgA-high IgA-low IgA-low IgA-low  IgA-low IgA-low 
Bacteroidales. Bacteroidaceae. 0.0007 0.0001 0.0535 0.1791 F=367.9 NS yea re does ee ies 
Bacteroides +/-0.0005 +/-0.0001 +/-0.0122 +/-0.0127 P<0.0001 
Bacteroidales.Porphyromonadaceae. 0.0002 0.0001 0.0015 0.0173 F=80.90 NS NS ieee NS eve ers 
Parabacteroides +/-0.0003 +/-0.0001 +/-0.0015 +/-0.0034 P< 0.0001 
Bacteroidales.Prevotellaceae. 0 +/-0 0 +/-0 0.0657 0.0002 F=14.66 NS ee NS ie NS “ex 
Prevotella +/-0.0343 +/-0.0003 P=0.0003 
Bacteroidales.S24-7. 0.8166 0.1087 0.7920 0.0905 F=318.0 ie NS Mie sbiee NS ica 
Unclassified +/-0.0809 +/-0.0164 +/-0.0098 +/-0.0377 P< 0.0001 
Burkholderiales.Alcaligenaceae. 0.0006 0.0004 0.0241 0.4037 F=660.5 NS NS oe NS eee pone 
Sutterella +/-0.0003 +/-0.0002 +/-0.0057 +/-0.0303 P< 0.0001 
Clostridiales. Unclassified. 0.1665 0.0019 0.0264 0.0002 F=13.75 as - ae NS NS NS 
Unclassified +/-0.0850 +/-0.0014 +/-0.0109 +/-0.0002 P=0.0003 
Clostridiales.Lachnospiraceae. 0.0007 0.0012 0.0001 0.0014 F=3.717 7 
Unclassified 4/-0,0004 +/-0.0007 +/-0.0002 +/-0.0010 P=0.0423  “S id oa NS ue 
Clostridiales. Mogibacteriaceae. 0.0004 0.0753 0.0007 0.0674 F=3.021 NS NS NS NS NS NS 
Unclassified +/-0.0003 +/-0.0851 +/-0.0008 +/-0.0407 P=0.0716 
Enterobacteriales.Enterobacteriaceae. 0.0003 0.0004 0.0004 0.1760 F=531.4 NS NS jae NS eee anes 
Unclassified +/-0.0005 +/-0.0004 +/-0.0000 +/-0.0152 P<0.0001 
Enterobacteriales.Enterobacteriaceae. 0.0014 0.7870 0.0015 0.0322 F=360.4 bate NS NS ‘ue ae NS 
Proteus +/-0.0016 +/-0.0793 +/-0.0006 +/-0.0198 P<0.0001 
Lactobacillales.Lactobacillaceae. 0.0127 0.0249 0.0339 0.0319 F=1.510 
Lactobacillus +/-0.0066 +/-0.0176 +/-0.0245 +/-0.0048 P=0.2622 Ne NS Re NS NS NS 


Means and standard deviations of assigned order.family.genus operational taxonomic units are depicted. Each taxon was compared between groups by one-way ANOVA followed by Tukey’s multiple 
comparisons test (P< 0.5). 
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Immunosuppressive plasma cells impede 
T-cell-dependent immunogenic chemotherapy 
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Cancer-associated genetic alterations induce expression of tumour 
antigens that can activate CD8* cytotoxic T cells (CTLs), but the 
microenvironment of established tumours promotes immune tol- 
erance through poorly understood mechanisms’. Recently 
developed therapeutics that overcome tolerogenic mechanisms 
activate tumour-directed CTLs and are effective in some human 
cancers'. Immune mechanisms also affect treatment outcome, and 
certain chemotherapeutic drugs stimulate cancer-specific immune 
responses by inducing immunogenic cell death and other effector 
mechanisms**. Our previous studies revealed that B cells recruited 
by the chemokine CXCL13 into prostate cancer tumours promote 
the progression of castrate-resistant prostate cancer by producing 
lymphotoxin, which activates an IkB kinase o (IKKa)-BMI1 mod- 
ule in prostate cancer stem cells**. Because castrate-resistant pro- 
state cancer is refractory to most therapies, we examined B cell 
involvement in the acquisition of chemotherapy resistance. Here 
we focus on oxaliplatin, an immunogenic chemotherapeutic 
agent** that is effective in aggressive prostate cancer’. We show 
that mouse B cells modulate the response to low-dose oxaliplatin, 
which promotes tumour-directed CTL activation by inducing 
immunogenic cell death. Three different mouse prostate cancer 
models were refractory to oxaliplatin unless genetically or phar- 
macologically depleted of B cells. The crucial immunosuppressive 
B cells are plasmocytes that express IgA, interleukin (IL)-10 and 
programmed death ligand 1 (PD-L1), the appearance of which 
depends on TGF receptor signalling. Elimination of these cells, 
which also infiltrate human-therapy-resistant prostate cancer, 
allows CTL-dependent eradication of oxaliplatin-treated tumours. 

Using the autochthonous transgenic adenocarcinoma of the mouse 
prostate (TRAMP) model of metastatic prostate cancer (PC)*, we 
examined how lymphocytes affect the response to low-dose oxalipla- 
tin. Although early (=0.2 g) tumours responded to oxaliplatin regard- 
less of B cell status (Extended Data Fig. 1a, b), after reaching =0.7 g, 
wild-type tumours became largely resistant to ‘late’ chemotherapy 
(Fig. 1a). However, tumours arising in B-cell-deficient TRAMP.Jh’ _ 
hybrid mice were oxaliplatin sensitive (Fig. 1a), although B cells had 
little effect on tumour progression and histology (Extended Data 
Fig. lc, d). CD8"* -cell-deficient TRAMP;Cd8a~’~ mice bearing small 
tumours were less responsive to oxaliplatin, but large tumours were 
treatment-resistant (Fig. la and Extended Data Fig. 1b). Similar results 
were obtained by subcutaneous transplantation of mouse Myc-CaP 
(MC) prostate cancer cells’. Whereas small MC tumours (=100 mm*) 
were chemotherapy responsive in wild-type mice (Extended Data 
Fig. le, f), large MC tumours (=350-400 mm?*) shrank after oxaliplatin 
treatment only in Jh~/~ (also known as Igh-J~ ‘~) mice (Fig. 1b-d). No 
response was observed in Cd8a~’~ mice. Oxaliplatin responsiveness 
was associated with enhanced caspase 3 activation, but the tumoral 


DNA damage response measured by histone H2AX phosphorylation 
was similarly activated by oxaliplatin, regardless of the host genotype 
(Fig. le and Extended Data Fig. 1g-i). Oxaliplatin treatment increased 
tumour-infiltrating CD45* cells in wild-type and Jh’~ mice, but myo- 
fibroblast activation and CD31 infiltration was more pronounced in 
wild-type mice (Extended Data Fig. 1j-l). Low-dose oxaliplatin 
enhanced TRAMP mouse survival in a manner dependent on CTLs 
and inhibitable by B cells (Extended Data Fig. 1m, n). B cell immuno- 
depletion also enhanced oxaliplatin-induced tumour regression and the 
effect was CTL-dependent (Fig. 1f). 

Oxaliplatin stimulated CD8* cell recruitment in TRAMP and 
TRAMP;Jh/ ~ mice, although more tumoral CD8* cells were found 
in the latter (Fig. 2a and Extended Data Fig. 2a). B cell deficiency also 
enhanced oxaliplatin-induced CD8* and CD4* cell recruitment 
into MC tumours and induction of perforin, interferon-y (IFNy) 
and tumour necrosis factor (TNF) in CD8* cells (Fig. 2b-e and 
Extended Data Fig. 2b-e). MC tumours in Jh~’~ mice contained 
more CD8°™ cells with activated STAT1, more proliferative CD8a* 
CD44"GrzB*Ki67~ cells and fewer ‘exhausted’? CD8a*CD44*PD- 
1° Tim-3* and CD8*BTLA cells, the presence of which in wild-type 
tumours was increased by oxaliplatin (Fig. 2f-h and Extended Data 
Fig. 2f-i). B cell immunodepletion also enhanced tumoral CTL activa- 
tion (Extended Data Fig. 2j-p). 

Oxaliplatin treatment greatly increased the number of tumoral 
B220*CD19" B cells (Fig. 3a and Extended Data Fig. 3a, b). After 
3-4 treatment cycles, at least 40% of tumoral B cells were cp207/°" 
CD19*B220'°”CD138* plasma cells, 40-80% of which expressed IgA 
(Fig. 3b, c and Extended Data Fig. 3c-l). IgA* B cells became detect- 
able 48 h after the first treatment cycle, and their abundance increased 
to nearly 80% of B220'™ cells after additional cycles (Extended Data 
Fig. 3g, 1). When cultured ex vivo, tumoral IgA“ B cells released soluble 
IgA (Extended Data Fig. 4a). Oxaliplatin also increased serum IgA in 
both TRAMP and MC-tumour models, but had little effect on serum 
IgG (Extended Data Fig. 4b-e). Plasmocytic IgA* cells were found 
adjacent to «-smooth muscle actin (2SMA)-expressing myofibroblasts 
(Fig. 3d), which produce CXCL13 (ref. 10). Oxaliplatin-induced IgA* 
B cells from spleen and MC tumours expressed activation-induced 
cytidine deaminase (Extended Data Fig. 4f, g), suggesting recent 
class-switch recombination (CSR). 

The IgA CSR is mainly induced by TGFf together with CD40L, IL- 
21, IL-10 or IL-6 (ref. 11). Indeed, oxaliplatin increased the proportion 
of tumoral B cells containing phosphorylated SMAD2/3, and induced 
Tgfb1 messenger RNA in tumours (Fig. 3e and Extended Data Fig. 4h-j). 
Oxaliplatin also increased IL-21 expression and STAT3 phosphoryla- 
tion in tumoral B cells (Extended Data Fig. 4k, 1), as well as 1110 mRNA 
in tumours, tumoral IL-10-producing B cells and IL-10 content 
per B cell (Fig. 3f, g and Extended Data Fig. 4m). Nearly 50% of 
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Figure 1 | B cells inhibit oxaliplatin-induced tumour regression. a, TRAMP 
(FVB) mice (wild-type (TR-WT), TR-Jh ‘~ and TR-Cd8a'~; n = 7-15 per 
group) received weekly oxaliplatin (6 mg kg”), starting at week 16. After 

4 weeks, prostate weights were measured. Dashed red line denotes prostate 
weight of naive controls. b, Tumour growth in mice transplanted with MC cells 
and treated with oxaliplatin, as in Extended Data Fig. le (late treatment), or 5% 
dextrose (n = 4-9 per group). c, Weights of MC tumours after oxaliplatin or 
vehicle treatment (n = 5-7 per group). d, Mice of indicated genotypes bearing 
MC tumours (n = 7-11 per group) were treated as above. After three cycles, 
tumour volumes (mm*) were determined. e, Numbers of cleaved caspase 3 
(CC3) CD45" cells per high-magnification field (HMF; 200) in tumours 
from Extended Data Fig. 1g. f, MC tumours were inoculated into wild-type 
(left) or Cd8a~/~ (right) mice. After 16 days, B cells were depleted with 
antibodies against CD19, CD20, CD22 and B220. Four days after first twice- 
weekly antibody treatment, mice received weekly oxaliplatin (n = 4-7 per 
group, total: 42), and euthanized 3 weeks later. Results are mean + s.e.m. 
Tumour volumes were first analysed by Kruskal-Wallis test comparing all 
groups: **P = 0.007. Mann-Whitney and t-tests were used to determine 
significance between two groups indicated, as *P < 0.05; **P < 0.01; 

***P < 0.001. The number of mice and replications for each experiment are 
also shown in Supplementary Table 1. NS, not significant. 


IgA*CD19* plasmocytes contained IL-10 mRNA and protein 
(Fig. 3h, i and Extended Data Fig. 4n). Oxaliplatin induced the pro- 
duction of Fas ligand (Fas-L) and PD-L1 in about 50% of IgA” plas- 
mocytes, 40% of which expressed both PD-L1 and IL-10 (Fig. 3j, kand 
Extended Data Fig. 3f-j). Most PD-L1” cells expressed IgA and con- 
tained phosphorylated SMAD2/3 (Extended Data Fig. 4j). However, 
lymphotoxin «/B (LT«/B)-producing B cells did not express IL-10 
and their abundance was barely increased by oxaliplatin (Extended 
Data Fig. 40, p). Tumoral CD19* cells did not express CD5, a B 
regulatory (Byeg) cell marker'* (Extended Data Fig. 4q). Oxaliplatin 
induced other immunoregulatory molecules, including NOS2, ARG1, 
IL-12p35 and IL-12p40, but no differences were observed between 
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Figure 2 | B cells inhibit oxaliplatin-induced T cell activation. a, CD8* 

T cells in TRAMP prostates (wild-type, Jh’~; n = 4-6 per group) from mice 
treated as in Fig. 1a, enumerated by flow cytometry and normalized to CD45* 
cells. Tu, tumoral. b, Mice (n = 6-8 per group) bearing MC tumours were 
analysed as above for CD8™ T cells in spleens and tumours after three 
chemotherapy cycles. c, d, Quantitative PCR with reverse transcription (qRT- 
PCR) analysis of perforin (Prf1) and Ifng mRNA in MC tumours collected as in 
b (n = 4-7). e, IFNy expression by CD8* cells from tumours (n = 6-8) from 
b after in vitro re-stimulation with tumour cell lysate. f-h, Expression of GrzB 
and Ki-67 (f), PD-1 and Tim-3 (g) and BTLA (h) in CD8* T effector cells 
(CD8* CD44"; £, g) or total CD8° cells (h) from tumours of MC-inoculated 
mice (b). Results are percentages of positive cells in tumoral CD8* cells or 
mean fluorescence intensities (MFI) and are mean = s.e.m. of three 
independent experiments (n = 4-8 mice per group). *P < 0.05; **P < 0.01; 
*P < 0.001 (Mann-Whitney and t-tests). 


tumour-bearing wild-type and Jh’~ mice, although the latter 
expressed higher amounts of IL-12 (Extended Data Fig. 5a—d). B cell 
deficiency or depletion had no significant effect on tumoral natural 
killer cells, myeloid CD11b*Gr1”* cells, macrophages or Treg cells 
(Extended Data Fig. 5e-i). Thus, unlike mouse skin cancer, in which 
B cells modulate therapeutic responsiveness through macrophages”, 
B cells in mouse PC impede immunogenic chemotherapy by suppres- 
sing CTL activation. 

Human PC samples (” = 110) were analysed for CD8* and CD20* 
cells (Extended Data Fig. 6a, b). Comparison of matched normal and 
tumour tissues from 87 patients with early-stage PC (E-PC) indicated 
higher CD8* and CD20" counts in tumours (Extended Data Fig. 6c, d). 
Patients with therapy-resistant PC (TR-PC) or metastatic PC (M-PC) 
exhibited reduced tumoral CD8™ cell density relative to patients with 
E-PC, the tumours of which contained fewer B cells than TR-PC and 
M-PC, in which B cells were most abundant (Extended Data Fig. 6¢, f). 
E-PC specimens displayed higher CD8/CD20 ratios than specimens 
from TR-PC and M-PC (Extended Data Fig. 6g). Immunofluo- 
rescence and immunohistochemical analyses of human PC specimens 
revealed IgA* cells in a scattered formation, frequently next to 
o%SMA* myofibroblasts, especially in the high-risk group (Fig. 31 
and Extended Data Fig. 6h-j, n). CD20" B cells were both scattered 
and clustered in lymphoid follicle-like’® areas (Extended Data Fig. 6b, 
k). Human PC also contained IL-10-producing IgA* CD138" cells 
and some IgA” cells were adjacent to CD8* T cells and expressed 
little CD20 (Extended Data Fig. 6i-k). 25% of IgA* cells in fresh 
prostatectomy specimens expressed IL-10 and were enriched in the 
malignant tissue portion (Extended Data Fig. 6l, m). IgA*CD138" 
plasmocytes exhibited higher density in TR-PC and M-PC than E-PC 
and patients with higher IgA*CD138* cell counts showed lower 
CD8/CD20 ratios (Extended Data Fig. 6n-p). Oncomine analysis of 
human IgA (IGHA1) mRNA revealed increased levels in malignant 
versus healthy prostates in 11 out of 15 data sets. Of these, five showed 
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a significant increase (P < 0.05) and three showed a >twofold change. 
Results of one analysis’* are presented (Extended Data Fig. 6q) and fit 
earlier findings in mice**’*, suggesting that tumour infiltrating lym- 
phocytes also control malignant progression and response to therapy 
in human PC. 

Consistent with previous knowledge! and SMAD2/3 activation in 
PD-L1* cells, TGFBR2 ablation in B cells (Tefbr2“*) enhanced oxali- 
platin-induced tumour regression, mildly decreased tumour-infiltrat- 
ing, but not splenic, B cells, and inhibited oxaliplatin-induced IgA* 
plasmocyte generation without affecting IgG1* or IgG2a™ cells 
(Fig. 4a—d and Extended Data Fig. 7a—e). IgA ablation also potentiated 
oxaliplatin responsiveness without reducing tumoral B cells (Fig. 4a, b). 
Both TGFBR2 and IgA ablations prevented induction of tumoral PD- 
L1* or IgA“ IL-10" B cells by oxaliplatin, but barely affected IL-10 in 
B220"IgA B cells (Extended Data Fig. 7f, g). TGFBR2 ablation or 
IgA deficiency also increased tumoral CTL density, IFNy production 
and surface CD107a expression by CD8” T cells of oxaliplatin-treated 
mice (Fig. 4e, f). Suppressor B cells may attenuate T cell activation via 
PD-LI (ref. 16). Treatment of mice bearing MC tumours with oxali- 
platin plus anti-PD-L1, but not anti-PD-L1 alone, inhibited tumour 
growth, increased GrzB expression by effector T cells, downregulated 
PD-LI expression on IgA“ cells, and reduced serum IgA, but not IgG 
(Extended Data Fig. 7h-m). Reconstitution of tumour-bearing Jn /— 
hosts with B cells lacking either PD-L1 or IL-10 failed to inhibit 
oxaliplatin-induced tumour regression (Fig. 4g and Extended Data 
Fig. 7n-p). PD-L1 ablation did not affect IL-10 expression, and IL-10 
ablation had no effect on PD-L1 (Extended Data Fig. 10m, n), indi- 
cating that both molecules are needed for plasmocyte-mediated 
immunosuppression. 

We used oxaliplatin because of its well-described immunogenic 
properties, which are not exhibited by the related compound cispla- 
tin**. Both oxaliplatin and cisplatin induced apoptotic cell death but 
oxaliplatin was better in stimulating autophagy (Extended Data Fig. 8a, 
b). Importantly, only low-dose oxaliplatin induced regression of MC 
tumours in Ih’ ~ mice, whereas low-dose cisplatin was ineffective, and 
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Figure 3 | Oxaliplatin induces tumour 
infiltration with IgA ‘PD-L1* IL-10-producing 
plasmocytes. a, B220*CD19* B lymphocytes in 
20-week-old TRAMP mice prostates after four 
oxaliplatin cycles (n = 5-7 per group) normalized 
to prostate weights. b, c, B220, CD19, CD138 and 
IgA expression in tumoral B cells from a. Values are 
percentage of tumoral CD45* (b) or CD19* 

(c) cells. d, MC tumours (n = 4-5 per group) 
stained for aSMA (green) and IgA (red). Nuclei 


Oxaliplatin 


Oxali: 6.5 +0.7 counterstained blue with DAPI (4’,6-diamidino-2- 
phenylindole). Arrows denote IgA” cells, the 
Mt ge, es a numbers of which per HMF are displayed on the 
= bottom. e, Phosphorylated SMAD2/3 (p-SMAD2/3) 
@ 40 F in tumour-infiltrating B cells (n = 3-4 per group). 
8 30 f, 1110 mRNA in MC tumours (n = 5-6 per group). 
rs 20 g, Tumour-infiltrating IL-10*CD19*B cells in MC- 
7 46 P=0.08 WT mice, as percentages of CD45" cells. 
3 h, Percentages of IL-10-producing cells in tumoral 
“0 — (MC-WT) CD19*IgA* and CD19*IgA™ cells. 
Oxali - hs 4 i, IL-10 expression by tumoral (MC-WT) IgA* and 
TIgA*  IgA- IgA B cells (n = 4-6 per group). j, PD-L1 and Fas- 


L expression in B cells from TRAMP tumours. 

k, Pdll mRNA in MC tumours (n = 5-6 per group). 
1, Low- (nm = 5) and high- (n = 5) risk human PC 
specimens stained with IgA (red) and «SMA (green) 
antibodies. Arrows denote IgA” cells. Results are 
mean + s.e.m. of at least three independent 
experiments. *P < 0.05; **P < 0.01; ***P < 0.001 
(Mann-Whitney and f-tests). 


High risk 


only oxaliplatin increased the abundance of tumoral CD8* and CD4* 
cells (Extended Data Fig. 8c—e). Low-dose oxaliplatin did not increase 
intestinal permeability and had no effect on IgA production and other 
immune parameters in tumour-free wild-type or Tgfbr2*” mice 
(Extended Data Fig. 8f-k). 

Immunogenic chemotherapy also potentiates the effectiveness of 
adoptively transferred T cells (ATCT). Immunogenic TRAMP-C2 
cells!” were inoculated into B-cell-containing Tcrb ‘~ mice followed 
by oxaliplatin treatment and ATCT (Extended Data Fig. 9a). Bigger 
tumours in Tcrb ‘~ relative to wild-type mice confirmed TRAMP-C2 
immunogenicity (Extended Data Fig. 9b). However, despite successful 
T cell take and increased CD8* counts after oxaliplatin treatment, 
tumours were not rejected (Extended Data Fig. 9c-e). TRAMP-C2 
tumours were also raised in Ragl ~/-;OT-1 mice, which lack B cells 
and polyclonal T cells but contain CD8* cells directed against chicken 
ovalbumin (Ova)"*. Adoptively transferred CD8™ cells expanded and 
expressed GrzB in Ragl~’;OT-1 hosts, especially after oxaliplatin 
treatment (Extended Data Fig. 9f-h). Consequently, tumour growth 
was inhibited by ATCT combined with oxaliplatin (Extended Data 
Fig. 9i, j). More notable results were obtained in TRAMP;Ragl /~ 
mice transplanted with CFSE-labelled splenocytes from either naive 
wild-type (B and T cell transfer) or Jh-’~ (T cell transfer) mice 
(Extended Data Fig. 9k). CD8°* cell proliferation in bone marrow, 
spleens and prostates of transplanted mice indicated successful T cell 
take (Extended Data Fig. 91, m). Thirty days after lymphocyte transfer, 
prostate tumours were analysed. Oxaliplatin caused modest tumour 
shrinkage in mice receiving T and B cells, but in mice receiving only T 
cells it induced complete regression (Fig. 5a, b). Transplantation with 
T and B cells combined with oxaliplatin restored CD19" cells in spleen 
and prostate and serum IgA and IgG, including IgA and IgG directed 
against SV40 T antigen, the TRAMP oncogene (Fig. 5c-e and 
Extended Data Fig. 9n), indicating a tumour-specific humoral res- 
ponse. Transferred B cells expressed TIM-1 (Extended Data Fig. 90), 
a molecule involved in regulation of IL-10 expression and tolerance 
induction”. B and T cell co-transplantation led to the appearance of 
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Figure 4 | TGFBR signalling and IgA CSR are required for 
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controls. e, Frequency of tumoral CD8" T cells in mice from a. f, CD8* cells 
(5 X 10° per well) from a were re-stimulated with either MC lysate (left) or 
PMA plus ionomycin (right) and analysed for indicated markers. Percentages 
of marker-positive cells within tumoral CD8* cells are shown (n = 4-7 mice 
per group). g, B cells (5 X 10°; 98% pure) from wild-type, Pdli/2~/~ and 
1110 ‘~ mice were transferred into MC tumour-bearing Jh ’~ mice (16 days 
after inoculation) that received oxaliplatin 2 days later. Tumour volumes 
were determined on day 30 (n = 4-6 mice per group). Results are mean + 
s.e.m. In a-g, *P < 0.05; **P < 0.01; ***P < 0.001 (Kruskal-Wallis test, 
Mann-Whitney and t-tests). 


CD4* and CD8" cells in lymphoid organs, but T cell number was 
considerably lower in prostate tumours (Fig. 5f and Extended Data 
Fig. 9p-r). However, transplantation with B-cell-deficient splenocytes 
caused robust T cell infiltration into prostate tumours (Fig. 5f and 
Extended Data Fig. 9r). To confirm that IgA* B cells attenuate the 
response to immunogenic chemotherapy by inhibiting T cell activa- 
tion, we raised MC tumours in Ragl~’~ mice and transplanted them 
with T cells from wild-type mice immunized with MC cell lysate, with 
or without naive B cells from wild-type or Tgfbr2“* spleens. In this 
case, oxaliplatin induced tumour regression and CTL activation only 
in mice receiving T cells, or T cells and TGFBR2-deficient B cells, 
which produced little IgA (Fig. 5g-i and Extended Data Fig. 10a-c). 
Hence, only B cells that have undergone TGFBR signalling and IgA 
CSR suppress CTL activation. 


Figure 5 | Adoptively transferred B cells inhibit T-cell-dependent tumour 
eradication. a, b, TRAMP;Rag1 ~/— mice (16 weeks old) received weekly 
oxaliplatin. One day after the first treatment, CFSE-labelled splenocytes from 
wild-type or Jn’ mice were adoptively transferred into tumour-bearing 
mice (n = 4-5 per group). After three more oxaliplatin cycles, the prostates 
were photographed (a) and tumour weight was measured (b). c, Serum IgA 
in both adoptive cell transfer (ACT) groups and FVB wild-type mice. 

d, e, Serum anti-SV40-Tag IgA and IgG concentrations in indicated strains 
with or without ACT and/or oxaliplatin treatment. f, Frequency of CD8* 
cells amongst CD45* cells in TRAMP;Ragl ’~ prostates after ACT and 
oxaliplatin treatment. g, MC tumour-bearing Rag] ’~ mice were treated 
with oxaliplatin. One day later, mice (n = 4-5 per group) received activated 
T cells from wild-type mice immunized with MC cell extract without or with 
B cells from wild-type or Tefbr2“® mice. After two more treatments, mice 
were euthanized and tumour volumes determined. h, IFNy in tumoral CD8s* 
cells of above mice. Cells were re-stimulated with PMA and ionomycin 
before determining the percentages of IFNy-expressing cells in total CD8* 
cells (n = 4-5 per group). i, Serum IgA in above mice. Results are mean + 
sem. *P < 0.05; **P < 0.01; ***P < 0.001 (Mann-Whitney and t-tests). 
ND, not detectable. 
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Our results show that successful eradication of large prostate 
tumours by immunogenic chemotherapy requires removal of immu- 
nosuppressive IgA~ plasmocytes that are present both in mouse and 
human PC. Spontaneous and transplantable PC models contain IgA* 
plasmocytes that strongly suppress CTL activation after treatment 
with oxaliplatin, an immunogenic cell death inducer*. Although oxa- 
liplatin causes regression of small tumours, it does not activate CTLs or 
shrink large prostate tumours, despite inducing DNA damage, unless 
tumour-infiltrating immunosuppressive B cells are removed. These B 
cells are IgA-producing plasmocytes that express PD-L1, IL-10 and 
Fas-L. Genetic analysis confirms that much of the immunosuppressive 
activity derives from IgA*PD-L1*IL-10° cells. Development of these 
cells, which differ from the lymphotoxin-producing CD20* B cells 
that infiltrate androgen-deprived prostate tumours and stimulate cas- 
trate-resistant prostate cancer emergence through the IKKa-BMI1 
module**, depends on TGFBR signalling. Nonetheless, CD20*LT* 
B cells that are exposed to high TGFB concentrations and antigen in 
the PC microenvironment after oxaliplatin treatment may eventually 
become IgA™ plasmocytes. A likely source of TGFB is aSMA* myofi- 
broblasts that reside next to IgA” cells in oxaliplatin-treated mouse 
tumours and human PC samples’®. Alternatively, lymphotoxin-pro- 
ducing B cells may stimulate the IgA CSR, as signalling via LTB recep- 
tor on gut stromal cells is required for IgA production”. Although the 
anti-inflammatory and regulatory activities of intestinal IgA-pro- 
ducing cells*’, as well as other plasmocytes”, are well known, this is 
the first time, to our knowledge, IgA* plasmocytes were found to 
suppress anti-tumour immunity. 

IgA‘ plasmocytes within prostate tumours induce CD8* cell 
exhaustion’ and suppress anti-tumour CTL responses through PD- 
L1 and IL-10, either of which can induce anergy or exhaustion””’. Yet, 
B cells may regulate anti-tumour immunity by other mechanisms*”, 
including indirect control of T cell infiltration via macrophages’* and 
IL-10 production by Byeg cells”, although the latter only affect CD4* T 
helper cells”*?°, Notably, IL-10-expressing IgA” cells are most abund- 
ant in therapy-resistant and metastatic human PC and circulating IgA 
is a well-established adverse prognostic indicator in PC’’. We therefore 
suggest that elimination or inhibition of tumour infiltrating IgA* 
plasmocytes may be the key to successful immunotherapy of PC, as 
long as an immunogenic chemotherapeutic, such as oxaliplatin, is also 
used. Immunogenic chemotherapy may also enhance response rates to 
PD-1 or PD-L1 blockade in other malignancies, including bladder 
cancer and cutaneous melanoma in which only 35% of the patients 
exhibit a response”’. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 
Animal models. C57BL/6 and FVB control mice were from Charles River 
Laboratories and CD45.1 mice” were from the Jackson Laboratory, and all were 
bred at the University of California San Diego (UCSD) animal facility. C57BL/6- 
Tg(TRAMP)8247Ng/J (TRAMP mice)*® were backcrossed to the FVB strain for 
more than ten generations. The median survival of TRAMP-FVB mice was 23 
weeks compared to 52 weeks for TRAMP-C57BL/6 mice. TRAMP mice were 
crossed with B-cell-deficient (Jh~’~) mice*!, CTL-deficient (Cd8a~/~) mice” or 
Ragl ~/~ mice, which lack both B and T cells®, all in the FVB-background. OT-I 
mice were obtained from Taconic'®. Tgfbr2'’” (FVB-background) mice were 
obtained from H. Moses*. Tcrb-’~, CD19-Cre, 110’ and CD45.1 mice were 
purchased from the Jackson Laboratory. IgA gene-deficient (Iga ’~ ) mice*® were 
obtained from Baylor College of Medicine. Pdll/2-’~ mice were obtained from 
Genentech. CD19-Cre and Iga”’~ mice were backcrossed to the FVB strain for 
more than ten generations. All mice were maintained in filter-topped cages on 
autoclaved food and water at the UCSD animal facility and all experiments were 
performed in accordance with UCSD and NIH guidelines and regulations. 
Mouse treatment studies were matched design control trials. Accordingly, mice 
were randomly chosen and paired based on sex (male), age and tumour size 
(Extended Data Fig. la, e). For transplanted tumour models, tumour size was 
defined by the median tumour volume (for example, 400 mm”, for late treatments, 
Extended Data Fig. le). For TRAMP transgenic tumour models, treatment deci- 
sions were made based on age and mice were randomly chosen including a control 
littermate (Extended Data Fig. 1a). An identification code was assigned to each 
tumour-bearing mouse both in the transplanted and transgenic models, and the 
investigators were blinded to treatment allocation at the time of tumour volume 
measurement, autopsy and analysis. The number of mice used in each experiment 
and the number of experiments (replications) are shown in Supplementary Table 1. 
Flow cytometry and lymphocyte isolation. For lymphocytes isolated from spleen 
and lymph nodes, standard protocols using filters have been used. Lymphocytes 
were isolated from human blood using Ficoll-Paque PLUS (GE Healthcare Life 
Science) according to manufacturer’s recommendations. For lymphocyte isolation 
from tumours (mouse and human), tumours were cut into small pieces and 
incubated in dissociation solution (RPMI medium supplemented with 5% FBS, 
collagenase type I (200 U ml’ '), collagenase type IV (200 U ml '), and DNase I 
(100 pxg ml )) for 30 min at 37 °C. After incubation, cell suspensions were passed 
through a 50-1m cell strainer and washed twice. For large tumours (20.7 g), 
haematopoietic cells were pre-enriched using density gradient centrifugation 
(Percoll or Ficoll). Small tumours ($0.2 g) were pooled (two tumours per stain- 
ing). Red blood cells were lysed (RBC Lysis buffer, multi-species; eBioscience). 
For blocking of Fc-mediated interactions, mouse cells were pre-incubated with 
0.5-1 jg of purified anti-mouse CD16/CD32 (93) per 100 jl and human cells were 
incubated with FcR blocking reagent (Miltenyi Biotec). Isolated cells were stained 
with labelled antibodies in PBS with 2% FCS and 2 mM EDTA or cell staining 
buffer (Biolegend). Dead cells were excluded based on staining with Live/Dead 
fixable dye (eBioscience). For intracellular cytokine staining, cells were restimu- 
lated (Myc-Cap cell lysate, PMA/ionomycin or PMA/ionomycin/LPS, as indi- 
cated) in the presence of a protein transport inhibitor cocktail containing 
brefeldin A and monensin (eBioscience), as indicated. For CD107, a staining 
antibody was added to the culture during the stimulation. After 5 h, cells were 
fixed and permeabilized with BD Cytofix/Cytoperm reagent for cytokine staining. 
BD transcription factor buffer was used for Foxp3 and T-bet staining and BD 
Phosflow was used for p-SMAD2/3 and p-STAT staining (BD Biosciences) 
according to manufacturer’s recommendations. After fixation/permeabilization, 
cells were stained with labelled antibodies of interest. Moreover, [110 and Actb 
mRNA expression were analysed on single cell level by flow cytometry in com- 
bination with CD45, IgA and IL-10 protein staining, using FlowRNA II Assay kit 
(Affymetrix eBioscience) according to manufacturer’s protocols”*. Cells were ana- 
lysed on a Beckman Coulter Cyan ADP flow cytometer. Data were analysed using 
FlowJo software (Treestar). Immune cell analysis of tumour-free mice of different 
genetic backgrounds (C57BL/6 and FVB) and different genetic ablations are 
shown in Extended Data Fig. 10d—p. The gating strategies and isotype controls 
for p-STAT1 and IL-10 staining are shown in Extended Data Fig. 10q-u. 
Adoptive lymphocyte transfer. For ATCT, CD8* T cells were isolated from 
single cell suspensions, prepared from spleens and lymph nodes as described 
above, using CD8qa-specific microbeads and MACS-columns (both Miltenyi 
Biotec GmbH), and 5 X 10° CD8* T cells were transferred intraperitoneally 
(Extended Data Fig. 9a-j). For adoptive B cell transfer (ABCT), B cells (B220*/ 
CD19*) were isolated from single-cell suspensions prepared from spleens using 
CD19- and B220-specific microbeads and MACS-columns, and 5 X 10° B cells 
were transferred intraperitoneally (Fig. 4g and Extended Data Fig. 7n-p). For 
adoptive splenocytes transfer (ACT), single cell suspensions prepared from 
spleens were transferred intraperitoneally, with one total spleen injected per 
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mouse. Labelling with 5- (and 6-) carboxyfluorescein diacetate succinimidyl ester 
(CFSE; Molecular Probes) was done according to manufacturer’s protocol. Five 
million CD8* T cells or 7 X 10° B cells were transferred (equal to one spleen per 
mouse; Fig. 5a-f and Extended Data Fig. 9k-r). For combined adoptive B and T 
cell transfer (Fig. 5g-i and Extended Data Fig. 10a-c), T cells were isolated from 
wild-type FVB mice immunized with a Myc-Cap cell lysate as previously 
described”’. Specifically, Myc-Cap cells were incubated with oxaliplatin (40 1M) 
for 48 h. The extent of cell death was determined by flow cytometry, showing that 
more than 90% of cells were positive for annexin V and propidium iodide. The 
dead cells were injected subcutaneously into wild-type FVB mice. Seven days later, 
T cells were isolated from single cell suspensions of spleen and lymph nodes using 
a Pan T cell isolation kit (Miltenyi Biotec). B cells were isolated from spleens of 
naive FVB wild-type or Tefbr2“ mice using a Pan B cell isolation kit (Miltenyi 
Biotec). MC-tumour bearing Rag!’ mice received 5 X 10° T cells with or 
without 5 X 10° B cells (98% pure) from wild-type or Tefbr24 ® mice. Purity was 
analysed on a Beckman Coulter Cyan ADP flow cytometer and was always >98%. 
Absolute numbers of particular immune cells in spleen were calculated by 
multiplying the CD45* cell number from one spleen by the percentages of 
the particular cell type amongst CD45* cells. Absolute numbers of particular 
immune cells (for example, CD8* cells) in tumours were calculated by multiplying 
the cell number in one tumour portion by the percentages of the corresponding 
cell type in vital tumour cells divided by the weight of the analysed tumour 
fragment. 

Subcutaneous tumour models. Two million Myc-Cap* or 3 X 10° TRAMP-C2 
(ref. 17) cells (purchased from ATCC) were subcutaneously injected into the right 
flank. Tumours were measured every 2-3 days using a caliper. Tumour volumes 
were calculated as width” x length/2. 

Immune-mediated B cell depletion. B cells were depleted as previously 
described**. Mice were weekly injected (intraperitoneally) with a mixture of mono- 
clonal antibodies, each at 150 tg per mouse: rat anti-mouse CD19 (clone 1D3), rat 
anti-mouse B220 (clone RA36B2), and mouse anti-mouse CD22 (clone CY34). 
After 48 h, the mice were injected with a secondary antibody (mouse anti-rat « 
chain; GeneTex) at 150 ig per mouse. In addition, mice were injected weekly, but 
not on the same day, with 250 jig per mouse rat anti-mouse CD20 (Genentech). 
Rat anti-mouse IgG2a and IgG1 were used as isotype controls. Mice were treated 
for 3 weeks in total (Fig. 1f and Extended Data Fig. 2j-p). 

Oncomine data analysis. In silico analysis of human IgA (IGHAI) mRNA 
expression was performed using 15 PC microarray gene data sets'*””-” from the 
Oncomine database (Compendia Biosciences; http://www.oncomine.org)** com- 
paring a combined 126 carcinoma/adenocarcinoma specimens to 30 normal 
(either benign, disease-free normal and/or normal adjacent) tissue specimens. 
Evaluation criteria were set as P < 0.05, fold change >2.0, and gene rank in the 
top 10%. 

Analysis of human specimens. Paraffin-embedded specimens from a total of 110 
PC patients were integrated into a tissue microarray system (TMA) constructed at 
the Clinical Institute of Pathology at the Medical University of Vienna (MUV). All 
of the human specimens used for TMA construction were approved by the MUV 
Research Ethics Committee (1753/2014). The cohort included 87 patients with 
E-PC, 9 patients with TR-PC, and 15 patients with M-PC. Patients’ demographic 
and histopathological features are shown in Supplementary Table 2. 

TMA were designed to provide two cores of normal prostate tissue and four 
cores of PC tissue from each E-PC patient, and 3-6 cores of tumour tissue for each 
TR-PC and M-PC patient. Stained TMA slides were digitalized by virtual micro- 
scopy at 20X magnification with a fixed light intensity and resolution into a 
bright-field image using the Nanozoomer (Hamamatsu) scanner. Computer- 
assisted image analysis of individual TMA core images was used to quantify the 
percentage of CD8* and CD20* immune reactive area (IRA%) as a proportion of 
the total digitized haematoxylin-stained region, as previously described™. For each 
PC patient, the mean continuous values of CD8* and CD20* IRA% in TMA cores 
without technical artefact for normal and tumour prostate tissue were calculated 
and used for subsequent statistical analysis. The presence of CD138* and IgA* 
double immunoreactivity for plasma cells in the stromal compartment or directly 
contacting a cancer cell was semiquantitatively scored in TMA cores for each 
patient by an investigator who was blinded to the patients’ tumour features. A 
value of 0 was assigned to tissue cores without evidence of stromal CD138*/IgA~ 
double immunoreactive cells and a value of 1 was recorded when CD138*/IgA* 
double immunoreactive cells were present in the stromal compartment. 
Furthermore, after approval from the UCSD Institutional Review Board, whole 
tissue slides were subjected to immunohistochemical (IHC) analysis of sSMA*/ 
IgA, CD8*/IgA and IL-10*/IgA double staining from a cohort of formalin-fixed, 
paraffin-embedded radical prostatectomy specimens. As previously described”, 
this cohort included up to 50 patients, which were selected based on known clinical 
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outcome according to risk categories of low-, intermediate- and high-risk groups 
based on the D’Amico risk classification”. 

Anonymized fresh prostatectomy and blood samples from consented human 
subjects, and de-identified clinical information were provided under the UCSD 
Moores Cancer Center Biorepository and Tissue Technology Institutional Review 
Board approved protocol and provided to investigators (M.K., C.J.K., C.A.M.J. and 
D.E.H.) with Cancer Sample Banking Committee approval. Fresh, de-identified 
samples of human prostate tissue and blood in 10-ml EDTA-coated tubes were 
collected from patients undergoing radical prostatectomy for clinically localized, 
intermediate or high risk PC, Gleason grade 3+4 or higher. A board-certified 
genitourinary pathologist (D.E.H.) collected samples of fresh prostate tumour 
and adjacent benign tissue, within 1 h of radical prostatectomy, that were 5-10 
mm in diameter. 

Immunostaining. Tissues were embedded in Tissue Tek OCT (Sakura Finetek) 
compound and snap-frozen. Tissue sections were fixed in cold acetone/methanol 
or 3% paraformaldehyde for 3-10 min and washed with PBS. Slides were blocked 
with PBS/1% normal donkey or goat serum for surface staining or 0.2% gelatin 
(from cold-water fish skin; Sigma-Aldrich)/PBS/1% normal donkey or goat serum 
for intracellular staining for 30 min. Sections were incubated with primary anti- 
bodies for 1 or 12 h at room temperature or 4 °C, respectively. After washing with 
PBS, secondary antibodies were added for 1 h at room temperature. As negative 
controls, samples were incubated with isotype-matched control antibodies or 
secondary antibodies only. After staining with DAPI, sections were covered with 
Vectashield Mounting Medium (Vector Laboratories). TMA tissue slides from 
formalin-processed and paraffin-embedded tumour sections were processed for 
immunohistochemistry. After de-paraffinization and rehydration, sections were 
immersed in a pre-heated antigen retrieval water bath with a pH 6.1 citrate buffer, 
or Dako Target Retrieval Solution for 20 min at 95-96 °C. ImmPRESS Polymer 
System Diaminobenzidine tetrahydrochloride (DAB) peroxidase substrate-based 
chromogens were used for single staining of CD8, CD20 and for IgA staining when 
combined with CD138 and for aSMA staining when combined with IgA for IHC 
of human samples. ImmPACT Vector NovaRED peroxidase substrate-based 
chromogens were used for CD138 staining when combined with IgA for IHC of 
human samples. ImmPACT Vector Red Alkanine Phosphatase substrate-based 
chromogens were used for IgA staining when combined with aSMA for IHC of 
human samples. All stainings were done according to the manufacturer’s proto- 
cols (Vector Laboratories). Nuclei were lightly counterstained with a freshly made 
haematoxylin solution then further washed in water and mounted. Sections were 
examined using an Axioplan 200 microscope with AxioVision Release 4.5 software 
(Zeiss) or TCS SPE Leica confocal microscope (Leica). 

Antibodies. Antibodies specific for the following antigens were used: monoclonal 
rabbit antibody to cleaved caspase 3 (9661) or p-yH2AX (Ser139; 20E3) (Cell 
Signaling Technology); polyclonal rabbit antibody to: CD3 (Dako, IS503); 
aSMA (Dako); Tim-3 (B8.2C12); Tim-1 (RMT1-4); p-SMAD2/3 (D27F4); 
LC3B (D11) and CD138 (syndecan-1) (anti-mouse Biolegend,; anti-human 
Dako M115); IgA (mA-6E1, m11-44-2, mRMA-1, anti-mouse eBioscience/ 
Biolegend; anti-human for IHC: Dako, A0262; for FACS: Miltenyi); AID 
(MAID-2); CD8a (m53-6.7, human DAKO, C8/144B); CD45 (hOKT4); CD20 
(AISB12, hL26); CD44 (IM7); CD4(RM4-5); B220 (RA3-6B2); CD19 (m1D3, 
hHIB19); IgM (II/41); IgD (11-26c); TNF (MP6-XT22); IFNy (XMG1.2); GrzB 
(NGZB); CD107a (eBio1D48); PD-1 (J43); PD-L1 (MIH5); FAS-L1 (MFL3); Ki67 
(SolA15); IgG2a (m2a-15F8); IgG1 (M1-14D12); IL-10 (mJES5-16E3; hJES3-9D7: 
THC: hIL-10: AF-217-NA); CD69 (H1.2F3); FoxP3 (FJK/16 s); CD11c (N418); 
CD11b (M1/70); MHCII (M5/114.15.2); Gr-1 (1A8-166g); F4/80 (BM8) and 
NK1.1 (NKR.PIC) (all from eBioscience); CD31 (PECAM-1, MEC 13.3); CD45 
(m30-F11); p-STAT1 (pY701) and p-STAT3 (pY705) (BD Bioscience): and eSMA 
(anti-mouse ab5694; anti-human: DAKO, 1A4). The following Alexa 594-, Alexa 
647-, Alexa 488-conjugated secondary antibodies were used: donkey anti-rat IgG, 
donkey anti-rabbit IgG, donkey anti-goat IgG and goat anti-rat IgG (Molecular 
Probes, Invitrogen). 

ELISA. Anti-SV40 Tag immunoglobulin ELISA was performed as described*’. 
IgA and IgG ELISA kits were purchased from eBioscience, and used according 
to manufacturer’s protocols. Tumoral single cell suspension has been prepared as 
described above in the flow cytometry section, and was washed 2-3 times with 
PBS/2 mM EDTA/2% ECS, to remove soluble IgA. Thereafter, about 3 X 10° cells/ 
24-well were plated in either 10% FCS/DMEM or Hybridoma medium (Life 
Technology). After 24 h, the supernatants were analysed for IgA content. Media 
without cells were used as controls. 

Treatment with chemotherapy or antibodies. Oxaliplatin was diluted in 5% 
dextrose and intraperiotenally injected weekly at 6 mg kg | as indicated. Anti- 
PD-L1 antibody was intravenously injected at 10 mg kg! once, followed by 5 mg 
kg ' bi-weekly. Mice were treated for three weeks for a total of 7 doses per animal. 


qRT-PCR analysis. Total RNA was extracted using an RNeasy Plus kit (Qiagen). 
RNA was reverse transcribed using an [Script kit (Biorad). qRT-PCR was per- 
formed using Ssofast EvaGreen supermix (Biorad) on a Biorad CFX96 machine. 
Primer sequences are listed below and generally were obtained from the NIH 
qPrimerDepot (http://mouseprimerdepot.nci.nih.gov). The relative expression 
levels of target genes were measured in triplicates and normalized against the level 
of RPL32 expression. Fold-difference (as relative mRNA expression) was calcu- 
lated by the comparative C, method (2t@?1%? ~ gene of imteres))) | The following 
primers were used: Ifng, forward 5'-TGAACGCTACACACTGCATCT-3’, 
reverse 5'-GACTCCTTTTCCGCTTCCTGA-3’; Tnf, forward 5'-GGTCTGG 
GCCATAGAACTGA-3’, reverse 5'-CAGCCTCTTCTCATTCCTGC-3’; 110, 
forward 5'-GGTTGCCAAGCCTTATCGGA-3', reverse 5'-ACCTGCTCCAC 
TGCCTTGCT-3’; Prf1, forward 5'-TGGAGGTTTTTGTACCAGGC-3’, reverse 
5'-TAGCCAATTTTGCAGCTGAG-3’; Tgfb1, forward 5'-AAGTTGGCATG 
GTAGCCCTT-3’, reverse 5’-GGAGAGCCCTGGATACCAAC-3’; Nos2, for- 
ward 5'-TCCAGGGATTCTGGAACATT-3’, reverse 5’-GAAGAAAACCCC 
TTGTGCTG-3’; Argl, forward 5’-TTTTTCCAGCAGACCAGCTT-3’, reverse 
5'-CATGAGCTCCAAGCCAAAGT-3'; Gzmb, forward 5'-CTCTCGAATA 
AGGAAGCCCC-3’, reverse 5'-CTGACCTTGTCTCTGGCCTC-3’; Rp/32, for- 
ward 5’-TTGTGAGCAATCTCAGCACA-3’, reverse 5'-GGGAGCAACAAGA 
AAACCAA-3'; 1/21, forward 5'-CCCTTGTCTGTCTGGTAGTCATCTT-3’, 
reverse 5'-GGAGGCGATCTGGCCC-3’; 1112p35, forward 5'-GAGGACTTGA 
AGATGTACCAG-3’, reverse 5’-CTATCTGTGTGAGGAGGGC-3’; I112p40, 
forward 5'-GACCCTGCCCATTGAACTGGC-3’, reverse 5'-CAA CGT TGC 
ATC CTA GGA TCG-3'; PD-L1 (also known as Cd274), forward 5’- 
TGCTGCATAATCAGCTACGG-3’, reverse 5’-CCACGGAAATTCTCTG 
GTTG-3’. 

Statistical analysis. Data are presented either mean + s.e.m. or median of con- 
tinuous values and were analysed by Student’s t-test or Mann-Whitney U test, 
respectively, for comparison of two groups. Kruskal-Wallis test was used to com- 
pare three or more groups. The log-rank (Mantel-Cox) test was used to compare 
survival curves. Fisher’s exact chi-square P values were used to calculate statistical 
significance of categorical values between groups. Two-tailed P values of = 0.05 
were considered significant. Unpaired t-test-independent studies were used to 
determine the minimum sample sizes (StatsDirect Version 2.8.0). GraphPad 
PRISM software was used for statistical analyses. No statistical methods were used 
to predetermine sample size. The experiments were not randomized. Mouse treat- 
ment studies were matched design control trials (see ‘Animal models’). 
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Extended Data Figure 1 | Treatment schemes and characterization of 
tumours and mouse survival before and after treatment. a, Early and late 
treatment schemes for TRAMP mice. b, TRAMP mice (n = 3-6 per group) 
were subjected to early oxaliplatin treatment as described in a and prostate 
weights were determined at 14 weeks, 1 week after completion of four treatment 
cycles. Dashed red line indicates prostate weight of tumour-free controls (n = 
33 in total). Prostate weight in b is shown in a log scale. c, d, Histopathology of 
TRAMP tumours. c, Representative images of haematoxylin-and-eosin-stained 
prostate sections from TRAMP mice are shown. Scale bars, 100 um. PD, poorly 
differentiated adenocarcinoma; PIN, prostatic intraepithelial neoplasia; WD, 
well differentiated adenocarcinoma. d, Histopathological assessment of early 
and late treated TRAMP tumours in wild-type and Jh ‘~ mice without or with 
oxaliplatin treatment. The percentages of the different histotypes shown in c are 
depicted (n = 3-7 per group). Fisher’s chi-square analysis was used to calculate 
statistical significance. e, Early and late treatment schemes for mice bearing 
subcutaneous (s.c.) MC tumours. f, MC cells were subcutaneously transplanted 
into wild-type and Jh_‘~ mice (n = 3-7 per group) that were subjected to early 
oxaliplatin treatment when tumour volume was 100 mm’. Forty-eight hours 
after completion of three treatment cycles, mice were euthanized and tumour 
volumes (mm?) were measured (n = 19 in total). g, MC tumours from 
indicated mice were stained for CD45 (green) and cleaved caspase 3 (CC3; red) 
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(n = 4-6 per group). h, i, MC tumours (n = 3-5 per group) grown in wild-type, 
Jh-’~ and Cd8a~’~ mice were stained for CD45 (green) and yH2AX (red), and 
the YH2AX™ foci in CD45~ cells were enumerated (i). Scale bars, 100 um. All 
results are mean + s.e.m. j, Representative images of subcutaneous MC 
tumours (1 = 5-6 per group) from wild-type and Jh’~ mice, with or without 
oxaliplatin treatment stained for xSMA (red) and CD31 (green). k, l, Frequency 
of aSMA- (k) and CD31- (1) positive cells within tumours from j. Shown are 
median values + s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001 (Mann-Whitney 
and f-tests). m, TRAMP mice (wild-type, Cd8a~/~ or Jh-/"; n = 6-14 per 
group) were treated weekly with low-dose oxaliplatin. Moribund mice were 
euthanized, and survival was compared by Kaplan—Meyer analysis and 
significance was determined (wild-type: P > 0.05; Cd8a/~: P > 0.05; Jh“: 
**P < 0.002). n, Survival curves for the different TRAMP groups before and 
after oxaliplatin treatment. Significant differences in survival times are 
indicated on the right. No statistically significant differences in survival were 
found between wild-type and Jh~” or Jh-’~ and Cd8a~/~ mice without 
treatment. Significant differences in survival times were observed between all 
three oxaliplatin-treated groups (wild-type, Cd8a~’~ or Jh~“"; indicated on the 
right). *P < 0.05; **P < 0.01; ***P < 0.001 (Kaplan—Meyer analysis). The 
number of mice used in each experiment and the number of experiments 
(replications) are shown also in Supplementary Table 1. 
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Extended Data Figure 2 | B cells attenuate oxaliplatin-triggered CTL 
activation. a, Flow cytometry of CD8* T lymphocytes in prostates of 20-week- 
old TRAMP mice after four cycles of oxaliplatin treatment (n = 4-6 per group) 
normalized to prostate weights. b, Late subcutaneous MC tumours from wild- 
type and Jh-’~ mice were stained for CD8 and analysed by immunofluorescent 
microscopy. In the top left areas (white square), single CD8 staining (green) 
without DAPI counterstain is shown. Tumoral CD8" cells were counted in 3-4 
HME (200) per tumour (n = 4-5 tumours per group). Scale bars, 100 um. 
c, d, Late subcutaneous MC tumours were analysed by flow cytometry for 
cp4* lymphocytes in spleens (c) and tumours (d) after three oxaliplatin 
treatment cycles (n = 4-7 per group). The results show percentages of CD4* 
cells in the CD45* population. e, Flow cytometric analysis of TNF and IFNy 
expression by CD8" cells in MC tumours from wild-type and Jh-“~ mice 
treated as above (n = 6-8) and re-stimulated in vitro with tumour cell lysate. 
f, Flow cytometry of STAT1 phosphorylation in CD8* cells from MC tumours 
of treated and untreated wild-type and Jh ’~ mice (for isotype controls, see 
Extended Data Fig. 10u). The results are summarized in the right panel (n = 3 
mice per group). g, Expression of GrzB and Ki67 in CD8™ T effector cells 
(CD8* CD44") from spleens of MC inoculated mice after oxaliplatin 
treatment. h, i, Flow cytometry of PD-1 and Tim-3 expression by CD8* 
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T effector cells (CD8* CD44" cells) in spleen (h) and MC tumours (i) as 
indicated with or without oxaliplatin treatment. Shown are percentages of the 
corresponding CD8* T cells in the CD8*CD44* population (n = 3-5 per 
group). j, The experimental scheme for B cell immunodepletion in tumour- 
bearing mice. MC tumours were raised in wild-type or Cd8a‘~ mice, 16 days 
after subcutaneous tumour cell inoculation. B cells were depleted by twice 
weekly administration of antibodies directed against CD19, CD20, CD22 and 
B220. Four days after first antibody treatment, mice were treated with 
oxaliplatin (n = 4-7 mice per group, total: 44). After three weekly 
chemotherapy cycles, mice were euthanized. k, Flow cytometry analysis of 
tumour-infiltrating CD45*CD8* T cells stained for IFNy (left) or IFNy and 
TNE (right) after in vitro restimulation with PMA and ionomycin (n = 4-6 
mice per group). I-n, Flow cytometry analysis of CD19* (1, m) and IgA* 

(n) cells in spleens and tumours isolated from the wild-type mice described 
above, confirming depletion of CD19* B cells and oxaliplatin-induced IgA* 
cells in spleen and tumours. 0, Serum IgA concentrations in the mice described 
in j (n = 3-5 per group). p, Flow cytometry analyses of CD19" B cells in 
tumours isolated from Cd8a /~ mice subjected to B cell depletion or not, con- 
firmed the efficient depletion of tumoral CD19* B cells. All results are mean + 
s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001 (Mann-Whitney and t-tests). 
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Extended Data Figure 3 | Immunogenic chemotherapy induces tumour 
infiltration by immunosuppressive CD19* CD20°"B220°’IgA* B cells. 

a, b, MC tumours (n = 4-9 per group) raised in wild-type mice without or with 
oxaliplatin treatment were stained for B220 (a), and tumour-infiltrating B220* 
cells per HMF were enumerated (b). In a, single B220 staining (above) and 
combined staining B220/DAPI (below) are shown. Scale bars, 100 jim. ¢, d, The 
flow cytometry plots and gating strategy for analysis of splenic B cell 
populations using CD19, IgA, B220, CD138 and CD20 antibodies. Results from 
wild-type mice bearing MC tumours are shown in c and from oxaliplatin- 
treated mice in d (m = 8 mice per group). Oxaliplatin treatment modestly 
increased the amount of splenic IgA “ cells. Splenic IgA~ cells expressed CD138 
as expected and showed lower levels of B220 and CD20, in either control or 
oxaliplatin-treated mice. e, The gating strategies for analysis of tumoral B cells 
using CD19, IgA, B220 and CD138 antibodies. Results from MC tumours in 
two representative oxaliplatin-treated wild-type mice are shown (n = 8 mice 
per group), demonstrating the presence of IgA~ cells in oxaliplatin-treated 
tumours with a typical CD138*B220°” phenotype. f-i, Flow cytometry plots 
and gating strategies for analysis of tumoral B cell populations using CD19, 
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B220, CD138, IgA and PD-L1 antibodies. Results from wild-type mice bearing 
MC tumours without (f) or with (g) oxaliplatin treatment (n = 6 mice per 
group) and J Iga‘ mice bearing MC tumours without (h) or with (i) oxaliplatin 
treatment (1 = 6 mice per group) are shown. Oxaliplatin treatment increased 
the amount of tumoral IgA*CD138*B220°”PD-L1 “cells in wild-type mice. 
j, Flow cytometric analysis of PD-L1 and IL-10 expression in IgA*B220~/°” 
and B220*IgA” B cells from oxaliplatin-treated TRAMP tumours (n = 4). 
k, Flow cytometric analysis of IgA and CD138 expression by TRAMP tumour- 
infiltrating B cells. Shown are percentages of IgA” cells among all tumour- 
infiltrating CD19" CD138* cells. 1, Wild-type mice bearing MC tumours were 
treated with oxaliplatin as above. Two days after the first or last oxaliplatin 
cycle, mice were euthanized, tumours were isolated and analysed by flow 
cytometry as indicated (n = 6 per group). After dead-cell exclusion, tumour- 
infiltrating B cells were stained with CD19, CD20, B220, IgA and IgM 
antibodies. Shown are the results for control (left), one cycle (middle), and three 
cycles (right) of oxaliplatin treatment, demonstrating the presence of tumoral 
IgA* cells with aCD19*CD20'"B220° IgA * cell phenotype within 48 h after 
oxaliplatin treatment. 
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Extended Data Figure 4 | Immunogenic chemotherapy induces tumoral 
and systemic IgA production through CSR. a, Ex vivo analysis of IgA released 
by tumour single cell suspension isolated from oxaliplatin-treated TRAMP 
tumours. Single cell suspension from non-treated tumours and culture medium 
without cells were used as controls. b, c, Serum IgA (b) and IgG (c) in treated 
and untreated TRAMP mice and age-matched naive FVB controls (n = 7-14 
per group). d, Serum IgA amounts in control or oxaliplatin-treated mice 
bearing MC tumours (n = 5-7 per group) were determined and compared to 
age-matched naive FVB controls (n = 7). e, Serum IgG amounts in control or 
oxaliplatin-treated mice bearing MC tumours (m = 5-7 mice per group) were 
determined and compared to age-matched naive FVB controls. a—e, Results are 
mean + s.e.m. Mann-Whitney and t-tests were used to calculate statistical 
significance. f, g, Immunofluorescence analysis of activation-induced cytidine 
deaminase (AID, green) and IgA (red) expression in spleen (f, used as a positive 
control) and MC tumours from oxaliplatin-treated wild-type mice (g). Scale 
bars: 10 um (f, left and g) and 100 um (f, right). Arrows point to IgA* AID* 
cells. Shown are representative results of spleens and tumours isolated from 
four mice per group. h, RT-PCR analysis of Tgfv] mRNA in MC tumours 
raised in wild-type or Jh ‘~ mice without or with oxaliplatin treatment 

(n = 3-7 mice per group). Results are mean + s.e.m. i, Flow cytometry of 
SMAD2/3 phosphorylation in MC tumour-infiltrating B cells from wild-type 
mice before and after oxaliplatin treatment (n = 4 per group). Shown are the 
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MFI and percentages (see Fig. 3e). j, Flow cytometry of SMAD2/3 phosphory- 
lation and PD-L1 in MC tumour-infiltrating B cells from wild-type mice before 
and after oxaliplatin treatment (n = 4 per group). Shown are the percentages of 
PD-L1* p-SMAD2/3* cells within CD45*CD19* cells. k, RT-PCR analysis of 
1121 mRNA in MC tumours raised in wild-type or Jh~/~ mice without or with 
oxaliplatin treatment (n = 4-5 mice per group). Chemotherapy-induced [/21 
mRNA mainly in wild-type mice. 1, m, Flow cytometry of tumour-infiltrating B 
cells stained for p-STAT3 and IL-10 (n = 5-6 per group) before and after 
oxaliplatin treatment. n, Flow cytometry analysis of Actb (B-actin) mRNA, IL-10 
protein and 7/10 mRNA in MC tumour-infiltrating IgA * cells using PrimeFlow 
RNA technology (pooled data of 4 mice per group, after oxaliplatin treatment). 
Left, Actb mRNA gated on CD45°* cells; middle, 1110 mRNA and IL-10 protein 
expression after 1 h stimulation with PMA plus ionomycin and LPS gated on 
IgA cells, right panel: [110 mRNA and IL-10 protein expression after 5 h 
stimulation with PMA plus ionomycin and LPS, gated on IgA* cells. 0, p, Flow 
cytometric analysis of tumour-infiltrating B cells in TRAMP mice (n = 4-5 per 
group) stained for CD19, B220, IL-10 and LTB (0). The percentage of tumour- 
infiltrating LTB cells amongst all tumour-infiltrating B cells was determined 
(p). q, Flow cytometric analyses of CD5 expression by B cells from spleen 

and MC-tumour of wild-type mice after oxaliplatin treatment (n = 4-5 per 
group). Shown are mean + s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001 
(Mann-Whitney and t-tests). 
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Extended Data Figure 5 | Immunogenic chemotherapy or B cell deficiency 
has marginal effects on Trego natural killer and myeloid cells. a,b, GRT-PCR 
analyses of Nos2 (a) and Arg1(b) mRNA content of MC tumours (n = 4-7 mice 
per group). Chemotherapy induced Nos2 and Arg] expression in wild-type and 
Jh-’~ mice and no significant and consistent differences were found between 
both groups. c, d, RT-PCR analyses of 112p40 (c), I112p35 (d) mRNA in MC 
tumours grown in wild-type and Jh~/~ mice (n = 4-6 mice per group). 


e-i, Flow cytometry analyses of tumour-infiltrating or splenic lymphocytes and 
monocytes: tumoral Nk1.1* cells (e), tumoral CD11b*CD11c*MHCII* cells 
(f), tumoral CD1 1b*GR-1" cells (g), CD4* FoxP3" cells (splenic, h; tumoral, 
i). Cells in e-i are from tumour-bearing mice subjected to oxaliplatin treatment 
and/or B cell depletion as indicated (B cell depletion plus oxaliplatin; n = 4-6 
mice per group). Results are mean + s.e.m. Mann-Whitney and t-tests were 
used to calculate statistical significance. 
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Extended Data Figure 6 | Analyses of B and T cells in human prostate cancer 
specimens. a-g, Tissue microarrays of tumour and non-tumour tissue from 
110 PC patients were stained for CD8 and CD20 (5-6 spots per patient = 3-4 
tumour tissue and 2 non-tumour tissue). a, b, Representative examples of CD8 
(a) and CD20 (b) IHC of PC tissue microarrays (left). Right, computer assisted 
image analysis with ad hoc developed image software. Tumour tissue is 
represented in yellow and CD8* and CD20* cells are represented in red. The 
percentages of immune reactive area (IRA) occupied by CD8* or CD20* cells 
are shown. Scale bars, 200 jim. ¢, d, Comparison of CD8 and CD20 IRAs in 
matched non-tumour and tumour tissues from each E-PC patient (n = 87). 
e-g, Patients were divided into three subgroups: E-PC (nm = 86), TR-PC (n = 9) 
and M-PC (n = 15).e, CD8°" cell infiltration into tumour tissues of the different 
groups. f, CD20° cell infiltration into tumour tissues of the different groups. 
g, The CD8*/CD20" ratio for the different groups. Each dot represents one 
patient. Line indicates the median value. Mann-Whitney test was used to 
calculate statistical significance between the two groups. Kruskal-Wallis test 
was used to calculate statistical significance between the three groups. h, IHC 
analysis of low-risk (n = 5) and high-risk (n = 5) human PC specimens using 
IgA (red) and «SMA (black). Nuclei were counterstained with haematoxylin. 
Scale bar, 100 jum i, Immunofluorescence analysis of human PC showing IL-10- 
expressing (red) IgA* (green) CD138* (turquoise) plasma cells (n = 6). 
Representative images are shown. White arrow indicates IL-10-expressing 
IgA* cells. Scale bars, 50 tm. j, Human normal prostate (m = 3-5) and human 
PC (n = 5) were stained for IgA and CD8. Typical images are shown. Red and 


green arrows indicate IgA* and CD8* cells, respectively. Scale bar, 100 jm. 
k, Human normal prostate (1 = 3) and human PC (n = 5) were stained for IgA 
(red arrow) and CD20 (green arrow). Scale bar, 100 1m. 1, Flow cytometric 
analysis of human prostate tumour-infiltrating CD19* cells and IgA* cells. 
The percentages of IL-10-expressing B cells in CD19 IgA” (two different 
samples) and CD19" IgA” B cells are shown. m, Summary of results obtained 
from human blood samples taken from healthy donors (n = 3) and patients 
with PC (n = 5) and prostate tissue specimens (benign, malignant; n = 4) 
analysed by flow cytometry for IL-10 expression in CD19*IgA~ and 

CD19" IgA* B cells. n, 0, Tissue microarrays from 110 patients with PC 
(described above) were stained for IgA and CD138. Patients were divided into 
three subgroups: E-PC (n = 86), TR-PC (n = 9) and M-PC (n = 15). 

n, Representative images of IgA (immunoperoxidase) and CD138 (alkaline 
phosphatase) double staining of tumour tissues from each group. CD138* and 
IgA* double-positive cells in the PC stroma are indicated by the white arrows 
(haematoxylin counterstain). Scale bar, 100 jim. 0, Frequencies of IgA* and 
CD138* double-positive cells in the tumour stroma of the different PC patient 
groups. p, PC patient specimens were divided into two groups: IgA /"°" (n = 
64) and IgA* (n = 46). Shown is the CD8*/CD20* ratio for each group. 
Each dot represents one patient. Line indicates the median value. q, IgA MRNA 
expression (IGHA1) is significantly increased in human PC tissue relative to 
healthy or benign prostate tissue in 5 out of 15 studies evaluated via Oncomine. 
Results from one significant study’ are presented. *P < 0.05; **P < 0.01; 
***D < 0.001 (chi-square test and Fisher’s exact test). 
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Extended Data Figure 7 | Effects of TGFBR2, IgA, PD-L1 and IL-10 
ablations on tumour-infiltrating lymphocytes. MC tumours were raised in 
wild-type, Tgfbr2“” or Iga’ mice (n = 5-11 per group). Mice were subjected 
to three cycles of late oxaliplatin treatment after which splenic and tumoral B 
cells were analysed. After dead cell exclusion, splenic (Spl; a, b) and tumoral 
(Tu; ¢, d) B cells were stained with CD19, B220, IgA, IgG2a and IgG1 antibodies 
and analysed by flow cytometry. e, Serum IgG concentrations in control or 
oxaliplatin-treated wild-type, Tgfbr2“* or Iga~/~ mice bearing MC tumours (n 
= 5-9 per group). f, Flow cytometry of tumour-infiltrating CD19" B cells from 
wild-type, Tgfbr2“or Iga ‘~ MC tumour-bearing mice (n = 4-7 per group) 
analysed for PD-L1 expression, revealing lower PD-L1 surface expression on 
Tefbr24 and Iga~‘~ B cells after oxaliplatin treatment. g, Flow cytometry of 
tumour-infiltrating B220™ B cells (left) and IgA*B220° B cells (right) from 
wild-type, Tgfbr2*"or Iga ‘~ MC tumour-bearing mice (n = 4-7 per group) 
analysed for IL-10 expression, revealing no difference in IL-10 expression by 
B220™IgA B cells in the corresponding groups, and lower IL-10 expression by 
Tgfbr2° B cells after oxaliplatin treatment compared to wild-type mice. Results 
are mean + s.e.m. Mann-Whitney and t-tests were used to calculate statistical 
significance. h, The experimental scheme. Wild-type mice bearing MC tumours 
were divided into four treatment groups (n = 7-8 per group): (1) isotype 
control (IgG2a), (2) oxaliplatin (weekly), (3) anti-PD-L1 (twice weekly), and (4) 
oxaliplatin plus anti-PD-L1 (weekly and twice weekly, respectively). After three 


treatment cycles, mice were euthanized and analysed. i, Tumour growth curves 
of tumour-bearing mice and gross appearance of untreated and treated mice. 
Significance was determined by Mann-Whitney and t-tests. j, Flow cytometric 
analysis for GrzB expression by tumour- infiltrating CD8" T effector cells 
(CD8* CD44") from MC tumour-bearing mice treated as described above. 
Results are shown either as percentages of GrzB* cells amongst CD8* T cells 
(black), or percentages of GrzB* CD8 CD44" T cells amongst tumoral CD45* 
cells (red). k, Flow cytometry of PD-L1 expression on tumour-infiltrating IgA~ 
CD19" B cells in the different treatment groups. I, m, Serum IgA (1) and IgG 
(m) concentrations in the different treatment groups described in h. n, The 
experimental scheme for the experiment whose results are shown in Fig. 4g. B 
cells were isolated from wild-type, Pdli/2~’~ and Il10-’~ miceand5 X 10° cells 
(purity 98%) were intraperitoneally transferred into MC tumour-bearing 
Jh~’~ mice (16 days after MC cell inoculation). After 2 days (day 18), the mice 
were given three oxaliplatin treatment cycles and analysed. 0, Flow cytometric 
analysis of splenocytes after staining with CD45 and CD19 antibodies, 
confirming presence of B cells in the ABCT groups. Shown are percentages and 
absolute B cell numbers in spleen. p, Tumour infiltrating CD8™ cells from MC 
tumour-bearing Jh~/~ mice transplanted with B cells and treated as above were 
re-stimulated for 4h with PMA and ionomycin before flow cytometry (n = 4-6 
mice per group). Results are mean + s.e.m. *P < 0.05; **P < 0.01; ***P < 
0.001 (Mann-Whitney and t-tests). 
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Extended Data Figure 8 | Low-dose cisplatin treatment is devoid of 
immunogenic activity and low-dose oxaliplatin does not affect gut barrier 
function. a, Flow cytometry of MC cells stained with annexin V and propidium 
iodide 24 h after treatment with either oxaliplatin or cisplatin (both at 20 1M). 
b, Flow cytometry analysis of MC cells treated as above and stained with 
antibody to the autophagy marker LC3A. c-e, MC tumours were raised in wild- 
type and Jh~”~ mice until 400 mm? in size, after which the mice were treated 
with either cisplatin or oxaliplatin at 6 mgkg~' (n = 4-5 per group). After three 
weekly chemotherapy cycles, mice were euthanized. c, Tumour weights; left: 
wild-type mice; right: Jh~/~ mice. d, e, Flow cytometry of tumour-infiltrating 
CD8 (d) and CD4 (e) cells. Left: wild-type mice, right: Jh~ ‘“~ mice. f, Gut 


naive tumor-free mice 


permeability was measured in wild-type mice before and after low- (LD) and 
high- (HD) dose oxaliplatin treatment using orally administered fluorescein 
isothiocyanate (FITC)- dextran. Shown are FITC-dextran concentrations in 
serum (jg ml‘) (n = 5 mice per group). g, Serum IgA concentrations in naive 
wild-type (FVB) and Tgfbr24” mice before and after oxaliplatin treatment 

(n = 3-6 per group). h, IgA staining of colon sections of untreated or low-dose 
oxaliplatin-treated wild-type mice. Scale bars, 100 1m. i-k, Flow cytometry of 
CDs* ” ), CD4* (j) and Nk1.1* (k) cells in spleens of naive wild- type and 
Tgfbr2“ mice without or with oxaliplatin treatment. All results are mean + 
sem. *P < 0.05; **P < 0.01; ***P < 0.001 (Mann-Whitney and t-tests). 
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Extended Data Figure 9 | Immunogenic chemotherapy supports adoptive T 
cell transfer only in the absence of B cells. a, The experimental scheme. 
Immunogenic TRAMP-C2 cells were subcutaneously inoculated into wild-type 
or Tcrb ’~ mice. After 30 days, the mice were divided into four groups (n = 4-5 
per group): (1) control, (2) oxaliplatin (weekly), (3) ATCT, and (4) ATCT plus 
oxaliplatin (weekly). The first oxaliplatin cycle was given at day 31. Two days 
after the second cycle, CD8" T cells from CD45.1 X CD45.2 wild-type mice (3 
X 10° cells) were transferred into tumour-bearing mice and this was followed 
by two more oxaliplatin cycles after which mice were euthanized for analysis on 
day 59. b, Tumour volumes (mm?) c, d, Flow cytometric analysis of spleen 
(c) and tumour (d) cells after staining with CD45.1, CD45.2, CD8 and TCRaB 
antibodies, confirming expansion of adoptively transferred T cells. e, Tumour 
growth curves. f, The experimental scheme. Immunogenic TRAMP-C2 cells 
were subcutaneously inoculated into wild-type or Rag’ ;OT-1 mice (no B 
cells), that contain CD8* T cells specific for chicken ovalbumin which is not 
expressed by TRAMP-C2 cells. After 30 days, tumour-bearing Rag] ‘~;OT-1 
mice were divided into four groups (n = 3-4 mice per group): (1) control, (2) 
oxaliplatin treatment, (3) ATCT, and (4) oxaliplatin treatment plus ATCT. The 
first oxaliplatin cycle was given at day 31. Two days after the second oxaliplatin 
cycle, CD8* T cells (3 X 10°) from CD45.1 X CD45.2 mice were adoptively 
transferred into tumour-bearing mice, which were euthanized on day 59 and 
analysed. g, Flow cytometric analysis of tumour-infiltrating cells stained with 


LETTER 


CD45.1, CD45.2, CD8 and TCR antibodies, confirming infiltration of 
adoptively transferred T cells. h, Flow cytometric analysis of GrzB expression in 
adoptively transferred, tumour-infiltrating, CD8"* T effector cells 
(CD45.1° CD8* CD44*) from tumour-bearing mice treated as above. 

i, Tumour volumes (mm). j, Tumour growth curves. k, The experimental 
scheme for Fig. 5a-f. Sixteen-week-old TRAMP;Rag! ’~ mice (no B and T 
cells) were treated with oxaliplatin (weekly). One day after the first treatment 
cycle, CFSE-labelled splenocytes from either wild-type (B and T cells, SP-WT) 
or Jh~’~ (T but no B cells, SP-Jh-/) mice were transferred into the tumour- 
bearing mice (5 X 10° T cells per mouse; 4—5 mice per group). 1, m, After 6 days, 
one mouse from each group was euthanized , and the proliferation of CD8* 
(I) and CD4* (m) cells in bone marrow (BM), spleen and prostates was 
analysed by CFSE staining and flow cytometry. n-r, After three more 
oxaliplatin cycles (4 weeks in total), the mice were euthanized and analysed. 
n, Frequency of adoptively transferred CD19* cells amongst CD45* cells in 
spleens and prostates 30 days after ACT. 0, Flow cytometric analyses of CD19* 
B lymphocytes for TIM-1 expression in spleens (left) and prostates (right) of 
above mice. p-r, Flow cytometric analyses of T cells. Percentages of CD8* and 
CD4* cells in lymph nodes (LN; p); spleens (q); prostates (r) of above 
TRAMP;Ragl ’~ mice. Red: splenocytes from wild-type mice (T and B cell 
transfer), blue: splenocytes from Jh-’~ mice (T cell transfer). Results are mean + 
sem. *P < 0.05; **P < 0.01; ***P < 0.001 (Mann-Whitney and t-tests). 
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Extended Data Figure 10 | Immunogenic chemotherapy supports adoptive 
T cell transfer only in the absence of B cells and analysis of lymphocytes and 
monocytes in tumour-free mice. a, The experimental scheme for Fig. 5g-i. 
MC tumour-bearing Ragi ~~ mice (no B and T cells) were treated with 
oxaliplatin (weekly). One day after first oxaliplatin treatment, 5 X 10° T cells 
(negative selection) from wild-type mice immunized with MC cell lysate”’ were 
adoptively transferred into tumour-bearing mice (4-5 mice per group), alone 
or in combination with 5 X 10° B cells from wild-type or Tgfbr2“” mice (purity 
98%). After two more oxaliplatin cycles (3 weeks total), the mice were 
euthanized and analysed. b, Serum IgG analysis of above mice. c, Flow 
cytometric analysis of splenocytes after staining with CD45 and CD19 
antibodies. d—p, Wild-type, Ih, Iga’ and Tefbr2“? mice in the FVB 
background and wild-type, Pdl1/2-’~, I110-’" and Iga’~ in the C57BL/6 
background were analysed for the distribution of immune markers. d, Spleen 
weights of wild-type, Jh’~ and Tgfbr2“* mice in the FVB background. e, Flow 
cytometry of splenocytes for the following markers: CD3 (left), CD8 (middle) 
and CD4 (right), gated on the splenic CD45~ population. f, Absolute cell 
numbers of splenic CD3* (left), CD8* (middle) and CD4* (right) cells are 
shown (percentage X cell count of whole spleen). g, h, Flow cytometry for TNF 
and IFNy in CD8* cells from tumour-free wild-type, Jn, Tefbr2*8 and 
Iga’ mice (n = 6-8) that were re-stimulated in vitro with PMA and 
ionomycin and the representative flow cytometry panels (h). i, j, Flow 
cytometry of splenocytes from wild-type and Tgfbr2“ for: CD19* IgM“ cells 
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(i) and IgA (j) gated on the splenic CD45* population. k-n, Flow cytometry of 
splenocytes from wild-type, Pdl1/2~’~, Il10-’~ and Iga~’~ mice for: 
CD45*CD19*IgM* cells (k), CD45*IgA* cells (1), PD-L1 expression by 
CD19*IgM* cells (m), and IL-10 expression by CD19* cells (n), as indicated. 
0, p, Serum IgA and IgG concentrations were analysed in wild-type, Pdl1/2‘~ 
and [110 ’~ mice (n = 4-5 mice per group). q-u, The different gating strategies 
and staining controls are shown. q, Gating strategies for tumour-infiltrating 
lymphocytes: lymphocyte gate, dead cell exclusion, doublets exclusion, and 
gating on the CD45* population. r, Flow cytometric analysis of IL-10 and IgA 
expression, gated on the cD45* population: (1) isotype control (no staining); 
(2) non-stimulated splenocytes: showing IgA staining, but not IL-10; (3) 
stimulated splenocytes from 1110 ’~ mice showing IgA staining, but not IL-10; 
and (4) stimulated splenocytes from wild-type mice showing IgA and IL-10 
staining. s, Flow cytometric analysis of IL-10 and CD19 expression, gated on the 
CD19*B220* population. Left, stimulated cells from Ili0~’~ mice, showing B 
cell staining, but not IL-10; right: stimulated cells from wild-type mice showing 
B cell staining and IL-10 staining. t, Flow cytometric analysis of IL-10 and IgA 
expression, gated on the IgA* population: left: stimulated cells from I110-’~ 
mice, showing IgA cell staining, but not IL-10. Right, stimulated cells from 
wild-type mice showing IgA and IL-10 staining. These results confirm IL-10 
production by IgA * cells. u, Flow cytometric analysis of p-STAT1 staining with 
corresponding isotype control. All results are mean + s.e.m. *P < 0.05; **P < 
0.01; ***P < 0.001 (Mann-Whitney and t-tests). 
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Allogeneic IgG combined with dendritic cell stimuli 
induce antitumour T-cell immunity 


Yaron Carmi!, Matthew H. Spitzer)?, Jan L. Linde’, Bryan M. Burt®, Tyler R. Prestwood!, Nicola Perlman’, Matthew G. Davidson!, 
Justin A. Kenkel', Ehud Segal', Ganesh V. Pusapati*, Nupur Bhattacharya’ & Edgar G. Engleman! 


Whereas cancers grow within host tissues and evade host immunity 
through immune-editing and immunosuppression’~*, tumours are 
rarely transmissible between individuals. Much like transplanted 
allogeneic organs, allogeneic tumours are reliably rejected by host 
T cells, even when the tumour and host share the same major 
histocompatibility complex alleles, the most potent determinants 
of transplant rejection® *°. How such tumour-eradicating immun- 
ity is initiated remains unknown, although elucidating this process 
could provide the basis for inducing similar responses against nat- 
urally arising tumours. Here we find that allogeneic tumour rejec- 
tion is initiated in mice by naturally occurring tumour-binding 
IgG antibodies, which enable dendritic cells (DCs) to internalize 
tumour antigens and subsequently activate tumour-reactive T 
cells. We exploited this mechanism to treat autologous and autoch- 
thonous tumours successfully. Either systemic administration of 
DCs loaded with allogeneic-IgG-coated tumour cells or intratu- 
moral injection of allogeneic IgG in combination with DC stimuli 
induced potent T-cell-mediated antitumour immune responses, 
resulting in tumour eradication in mouse models of melanoma, 
pancreas, lung and breast cancer. Moreover, this strategy led to 
eradication of distant tumours and metastases, as well as the 
injected primary tumours. To assess the clinical relevance of these 
findings, we studied antibodies and cells from patients with lung 
cancer. T cells from these patients responded vigorously to auto- 
logous tumour antigens after culture with allogeneic-IgG-loaded 
DCs, recapitulating our findings in mice. These results reveal that 
tumour-binding allogeneic IgG can induce powerful antitumour 
immunity that can be exploited for cancer immunotherapy. 

To study the basis of allogeneic tumour rejection, we examined 
the immune response to tumours in major histocompatibility 
complex (MHC)-matched allogeneic mice (illustrated in Fig. 1a). 
B16 melanoma cells expanded continuously in syngeneic C57BL/6 
hosts yet spontaneously regressed in allogeneic 129S1 hosts 
(Fig. 1b). Conversely, liver-metastatic pancreatic tumour cells 
(LMP), isolated from Kras@!??"* ;LSL-Trp53"!774/* ;Pdx1-Cre 
mice", grew steadily in 129S1 mice but spontaneously regressed 
in C57BL/6 animals (Fig. 1b). Depletion of natural killer (NK) cells 
did not prevent tumour rejection (Extended Data Fig. 1a). In con- 
trast, depletion of CD4* or CD8°* T cells before allogeneic tumour 
inoculation prevented tumour regression (Fig. 1b). T-cell prolif- 
eration and tumour infiltration began by week 1 (Fig. 1c and 
Extended Data Fig. 1b). Additionally, allogeneic tumours con- 
tained more mature myeloid DCs (mDCs, Ly6C CD11b*CD11c* 
MHCII‘ CD64“) and fewer SSC’“CD11b™Ly6C"MHCII- myeloid 
cells than syngeneic tumours (Fig. 1d and Extended Data Fig. 1c). 
Even at day 3, mDCs in allogeneic tumours expressed higher levels of 
MHCII, CD86 and CD40 compared to mDCs in syngeneic tumours, 
reflecting activation (Extended Data Fig. 1d). Allogeneic mDCs inter- 
nalized more tumour-cell-derived molecules from CFSE-labelled LMP 


cells than syngeneic mDCs (Fig. le). However, co-culture of DCs with 
allogeneic tumour cells induced negligible activation or tumour antigen 
uptake (Fig. 1fand Extended Data Fig. le), demonstrating that additional 
factors contribute to DC activation in vivo. 

Notably, IgM and IgG antibodies were bound to allogeneic, but not 
syngeneic, tumour cells within 24 h after tumour inoculation (Fig. 1g-i), 
before T cells appeared (Fig. 1c). Moreover, allogeneic antibodies bound 
tumour cells more effectively than syngeneic antibodies (Extended Data 
Fig. 2a), including syngeneic antibodies from tumour-bearing mice 
(Extended Data Fig. 2b). To assess the potential role of antibodies in 
tumour rejection, B cells were depleted before mice were challenged 
with allogeneic tumours (Extended Data Fig. 2c). Antibody depletion 
accelerated tumour development and delayed or prevented tumour 
rejection (Fig. 1j). Moreover, adoptive transfer of allogeneic IgG, but 
not IgM, enabled rejection of syngeneic tumours (Fig. 1k and Extended 
Data Fig. 2d). This effect was abrogated in Fcy receptor (FcyR)-deficient 
mice (Fig. 1k). 

To investigate the effect of antibodies on tumour uptake by DCs, we 
incubated tumour cells or lysates with syngeneic or allogeneic antibodies 
to form immune complexes and added these to bone-marrow-derived 
DCs (BMDCs) (Fig. 2a). Only immune complexes from allogeneic IgG 
(allolgG-IC) or IgM (allolgM-IC) induced BMDC activation and uptake 
of tumour-derived proteins (Fig. 2b-d), which were found in proximity 
to MHCII molecules (Fig. 2e). BMDCs activated by allolgG-IC induced 
significant T-cell proliferation (Fig. 2f), demonstrating that tumour 
antigens were processed and presented. 

To determine whether immune-complex-bound DCs could elicit 
antitumour immune responses in syngeneic hosts, B16 or LMP cells 
were inoculated subcutaneously, and tumours were removed upon 
reaching 25-55 mm”, leaving tumour-free margins. IgG-IC or IgM-IC 
were prepared from excised tumours and incubated with syngeneic 
BMDCs, which were injected into the corresponding tumour-resected 
mouse (Fig. 2g). While nearly all mice treated with syngeneic BMDCs 
loaded with allolgG-IC remained tumour-free for over a year, all other 
animals experienced rapid tumour relapse (Fig. 2h). This response was 
completely abrogated in DCs lacking FcyR (Extended Data Fig. 3a-c). 
Furthermore, adoptive transfer of T cells from allolgG-IC-treated 
animals protected naive mice from tumour challenge (Extended Data 
Fig. 3d, e). 

Despite these findings, only minor effects were observed when allo- 
geneic IgG was injected into tumours in autologous hosts (Fig. 3a). To 
address this discrepancy, we obtained tumour-associated DCs 
(TADCs) (Extended Data Fig. 4a) and cultured them with tumour 
lysates or allolgG-IC. In contrast to BMDCs, TADCs displayed no 
activation (Fig. 3b-d and Extended Data Fig. 4b) and their transfer 
to tumour-resected mice had no effect on recurrence (Fig. 3e). 
Accordingly, p38, ERK1/2 and JNK were phosphorylated in BMDCs 
but not TADCs activated with allolgG-IC (Fig. 3f). We then tested the 
effect of additional MAPK stimuli on the response of TADCs to 
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Figure 1 | Tumour-binding antibodies initiate rejection of allogeneic 
tumours. a, Experimental design: injection of LMP and B16 cells 
subcutaneously into syngeneic and allogeneic hosts. b, Growth of LMP and B16 
tumours in C57BL/6, 129S1, CD4* cell-depleted or CDs* cell-depleted 
allogeneic mice (n = 6, 3 independent experiments). c, Percentages of LMP- 
infiltrating CD4* and CD8" T cells among CD45" cells (n = 5, 3 independent 
experiments). d, Percentages of LMP- infiltrating CD11b™Ly6C"™ cells and 
mature DCs among total cells (n = 4, 3 independent experiments). e, Myeloid 
cells in the draining lymph nodes of mice inoculated with CFSE-labelled LMP 
cells 3 days earlier (n = 5, 3 independent experiments). R1, macrophages; R2, 
myeloid dendritic cells; R3, classical dendritic cells. f, Tumour uptake, MHCII 
and CD86 expression by BMDCs and blood monocyte-derived (Mo) DCs 
incubated overnight with CFSE-labelled live, frozen/thawed (necrotic), or 


allolgG-IC. Poly(I:C), TNFa + CD40L or IFNy + CD40L enabled 
activation of TADCs and allolgG-IC uptake (Fig. 3g and Extended 
Data Fig. 4c, d). We subsequently tested whether allogeneic IgG in 
combination with one of these stimuli could induce immune responses to 
syngeneic tumours in situ. Intratumoral injection of allogeneic IgG com- 
bined with TNFa + CD4OL or poly(I:C) induced complete elimination 
of B16 and LL/2 tumours (Fig. 4a and Extended Data Fig. 5a-c). 

Under these conditions, only mDCs (CD11b’Ly6C CD11c* 
MHCII* CD64“) and cDCs (CD11b CD11c"MHCII*) markedly 
increased their IgG binding during an effective antitumour immune 
response (Fig. 4b and Extended Data Fig. 5d). Moreover, tumour-infilt- 
rating DCs exhibited significant activation (Fig. 4c) and accumulation in 
the draining lymph nodes (Extended Data Fig. 5e). Adoptive transfer of 
TADCs from treated mice into naive mice conferred complete protec- 
tion against B16 (Fig. 4d). In contrast, transfer of macrophages had a 
modest protective effect, while B cells, NK cells and mast cells provided 
no benefit (Extended Data Fig. 5f, g). 
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mitomycin-C-treated (apoptotic) LMP cells or fluorescein-labelled Escherichia 
coli BioParticles (n = 4, 4 independent experiments). MFI, mean fluorescence 
intensity. g, IgG and IgM bound in vivo to CFSE-labelled LMP cells 48 h after 
tumour inoculation (n = 5, 4 independent experiments). h, i, Representative 
staining of tumour sections by IgM and IgG 24 h after inoculation of CFSE- 
labelled LMP cells. Original magnification, 200; 3 independent experiments. 
j, Tumour size in 129$1, C57BL/6 and B-cell-depleted allogeneic hosts (n = 5,3 
independent experiments). k, B16 size in naive mice or mice injected with 
syngeneic or allogeneic antibodies (n = 5). B16 size in naive C57BL/6 and FcyR 
knockout (KO) mice injected with allogeneic antibodies (n = 5, 3 independent 
experiments). Experiments were independently repeated at least 3 times and 
analysed by Mann-Whitney U test. *P < 0.05; **P < 0.01; NS, not significant. 
Error bars represent s.e.m. unless specified otherwise. 


To test whether allogeneic IgG bears unique modifications that 
mediate an immune response, we covalently crosslinked syngeneic 
IgG onto B16 membrane proteins. These immune complexes still 
conferred a therapeutic benefit after incubation with BMDCs 
(Extended Data Fig. 6a), demonstrating that binding of IgG to the 
tumour cell surface, rather than the origin of the IgG, was critical. 
To investigate whether the tumour-binding antibody targets are 
related to the antitumour T-cell specificities, we resected B16 
tumour cells and formed immune complexes using an antibody 
against MHC-I, against which there could not be reactive T cells. 
DCs loaded with these immune complexes protected mice from B16 
recurrence without inducing autoimmunity, suggesting that 
tumour-reactive T-cell specificity is not determined by the antibody 
targets (Extended Data Fig. 6b). Furthermore, B16-bearing mice 
treated with allogeneic IgG + anti-CD40 + TNF« were protected 
from re-challenge with B16 melanoma, but not syngeneic RMA 
lymphoma, suggesting that the tumour-reactive T cells recognize 
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Figure 2 | AllolgG-IC are internalized and presented by BMDCs and drive 
protective immunity in vivo. a, Experimental design: tumour cells or lysates 
were incubated with syngeneic or allogeneic antibodies and then cultured with 
BMDCs overnight. b, Expression of CD86 and MHCII on BMDCs cultured 
with antibody-coated tumour lysates or intact tumour cells (n = 5, 10 
independent experiments). IC, immune complexes. c, TNFa and IL-12 in 
supernatants of BMDCs cultured overnight with immunoglobulin immune 
complexes formed with LMP lysate or intact LMP cells (n = 5, 4 independent 
experiments). d, Internalization of CFSE in BMDCs incubated overnight with 
immunoglobulin immune complexes formed from CFSE-labelled tumour 
lysates or CFSE-labelled intact cells (n = 4, 10 independent experiments). 

e, Representative localization of MHCII and immunoglobulin immune 


tumour-associated antigens rather than widely expressed allo-anti- 
gens (Extended Data Fig. 6c). 

Vaccination with BMDCs loaded with immune complexes contain- 
ing B16 proteins derived from the cell membrane, but not other sub- 
cellular fractions, prevented tumour relapse, and B16 protein 
denaturation, but not deglycosylation, removed the therapeutic benefit 
(Extended Data Fig. 6d). Pre-absorbing allogeneic IgG against normal 
cells syngeneic to the tumour also removed the therapeutic benefit 
(Extended Data Fig. 6e). Allogeneic IgG from germ-free mice induced 
tumour immunity (Extended Data Fig. 6f), suggesting that IgG against 
microbiota was not required. Therefore, the protective effect of allo- 
geneic IgG is dependent on antibody binding to membrane proteins 
expressed on normal cells. 

We therefore identified B16 membrane proteins specifically bound 
by allogeneic IgG using mass spectrometry. While syngeneic IgG 
bound six cell membrane proteins, all at approximately equal or lower 
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complexes on BMDCs cultured overnight with CFSE-labelled LMP cells coated 
with allogeneic antibodies (original magnification, 400X; 3 independent 
experiments). f, Proliferation of CD4° T cells cultured with DCs loaded with 
immune complexes formed from LMP and B16 lysates or intact cells (n = 5,5 
independent experiments). c.p.m., counts per minute. g, Experimental design: 
tumours were removed from mice, coated with antibodies, incubated for 24 h 
with BMDCs, and injected subcutaneously into corresponding tumour- 
resected mice. h, Tumour recurrence in mice treated with BMDCs loaded with 
tumour lysate incubated with allogeneic or syngeneic antibodies (n = 5, 3 
independent experiments). Experiments were independently repeated at least 3 
times and analysed by Mann-Whitney U test. *P < 0.05; **P < 0.01. Error bars 
represent s.e.m. unless specified otherwise. 


levels than allogeneic IgG, allogeneic IgG preferentially bound sixteen 
cell membrane proteins, many containing strain-specific polymorph- 
isms (Extended Data Table 1). To validate these hits functionally, we 
focused on transmembrane-glycoprotein NMB (GPNMB). Antibodies 
against GPNMB bound B16 cells at much higher levels than normal 
cells and enabled DC activation, and allogeneic IgG bound GPNMB at 
higher levels than syngeneic IgG (Extended Data Fig. 7a—c). Treatment 
using anti-GPNMB + anti-CD40 + TNFo induced significant FeyR- 
dependent tumour regression (Extended Data Fig. 7d, e). Treated 
tumours exhibited marked leukocyte infiltration, including activated 
effector/memory T cells, compared to untreated tumours (Extended 
Data Fig. 8a, b). Whereas all treatments elicited gp100-reactive CD8* 
T cells, only allogeneic IgG + antiCD40 + TNF elicited Trp2-react- 
ive CD8" T cells (Extended Data Fig. 8c). Adoptive transfer of CD4 or 
CD8 T cells from these mice protected naive mice from B16 challenge, 
and depletion of either CD4 or CD8 T cells before treatment prevented 
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Figure 3 | TADCs, but not BMDCs, require stimulation to respond to 
allolgG-IC. a, Tumour growth following intratumoral injection of PBS, 129S1 
IgG or C57BL/6 IgG (n = 6, 3 independent experiments). b, CD86 and MHCII 
expression on DCs incubated with PBS, tumour lysates or allolgG-IC (n = 5, 
10 independent experiments). c, TNFo. and IL-12 in the supernatants of DCs 
cultured with PBS control, LMP lysate or allolgG-IC (n = 5, 4 independent 
experiments). d, Proliferation of CD4* T cells cultured with DCs treated with 
PBS, tumour lysate, or allolgG-IC (n = 5, 5 independent experiments). 

e, Recurrence of resected LMP and B16 in untreated mice or mice treated with 


tumour regression (Extended Data Fig. 8d, e). These findings confirm 
that allogeneic IgG induces T-cell reactivity against tumour-associated 
antigens distinct from those bound by the antibodies. 

We next treated a genetically engineered melanoma model driven 
by Braf’®°™* and loss of Pten (ref. 12) with allogeneic IgG + anti-CD40 
+ TNFa. Treated mice experienced complete responses lasting over 8 
weeks in the injected tumours and distant sites (Fig. 4e). To assess the 
effect of this combination on metastases, orthotopic 4T1 breast 
tumours were treated after all mice had palpable tumour-draining 
lymph nodes, indicative of tumour spread. Only treatment with allo- 
geneic IgG + anti-CD40 + TNFa led to almost complete resolution of 
metastases and primary tumours, and the few remaining micrometas- 
tases were heavily infiltrated with leukocytes (Fig. 4f, g and Extended 
Data Fig. 9a). 

We next compared the capacity of IgG from cancer patients and 
healthy allogeneic donors to bind the patients’ tumours. Most but not 
all donors had antibodies with higher tumour-binding capacity 
(Extended Data Fig. 9b). We tested whether allogeneic IgG + 
CD40L + TNFa could induce tumour uptake and maturation of 
human TADCs from two patients with lung carcinoma. Addition of 
CD40L + TNFa enabled these DCs to internalize allolgG-IC and 
induced DC activation (Fig. 4h and Extended Data Fig. 9c, d). 
Moreover, BMDCs from two patients with malignant pleural mesothe- 
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allolgG-IC-activated BMDCs or TADCs (n = 5, 3 independent experiments). 

f, Phosphorylated (p)-p38, pERK1/2 and pJNK levels in DCs, untreated or 
incubated with allolgG-IC. Graphs show arcsinh ratios of phospho-species in DCs 
incubated for 5 min with LMP lysate or allolgG-IC over baseline levels from 
unstimulated DCs (n = 5, 5 independent experiments). g, MHCII and CD86 
expression and CFSE internalization by TADCs after overnight culture with 
CFSE-labelled allolgG-IC (n = 4, 10 independent experiments). Experiments 
were independently repeated at least 3 times and analysed by Mann-Whitney U 
test. *P < 0.05; **P < 0.01. Error bars represent s.e.m. unless specified otherwise. 


lioma incubated with allolgG-IC, but not autologous IgG-IC, exhibited 
activation and drove autologous CD4* T-cell proliferation (Fig. 4i). 

The effect of naturally arising tumour-reactive antibodies on 
tumour progression has been a source of controversy. Some studies 
suggest that such antibodies promote tumour progression’*”’, 
while others report that they can stimulate antitumour immun- 
ity?’ **. Like the antibodies that develop in cancer patients, com- 
mercial immunoglobulin preparations, which probably contain 
tumour-binding alloantibodies, have shown limited benefit when 
used to treat cancer”’’. Our data may provide a mechanistic 
explanation for these findings, as they show that while TADCs 
are not naturally responsive to IgG-IC, addition of specific stimuli 
enables them to drive tumour-eradicating immunity. Hence, the 
role that tumour-binding antibodies have in tumour immunity 
depends upon the environmental context and the cell types 
involved. 

Here we demonstrate that tumour-antigen presentation after 
antibody-mediated uptake by DCs is sufficient to initiate protect- 
ive T-cell-mediated immunity against tumours. Our work suggests 
that this fundamental mechanism of immunological recognition 
and targeting, which prevents tumour transmission even between 
MHC-matched individuals, can be exploited as a powerful thera- 
peutic strategy for cancer. 
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Figure 4 | Injection of tumours in situ with alloantibodies in combination 
with CD40 agonists and TNFa induces systemic DC-mediated antitumour 
immunity. a, Growth of tumours injected with allogeneic IgG (alloIgG), with or 
without immune stimuli (n = 6, 3 independent experiments). b, Mean 
fluorescence of phycoerythrin in myeloid cells from B16-bearing mice 2 h after 
treatment (n = 4, 3 independent experiments). c, CD86 and MHCII expression on 
DCs from B16 tumours 5 days after treatment (n = 6, 3 independent 
experiments). d, B16 growth in mice vaccinated with 2 x 10° DCs transferred 
from treated or untreated B16 tumours (n = 6, 3 independent experiments). 

e, Tumour number in Tyr:CreER;Braf“""/Pten"™”"™ mice following treatment 
(n = 4, 3 independent experiments). Photographs show representative mice on 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Mice. 129S1/SvlmJ mice, C57BI/6 wild-type (WT) mice, Balb/c mice, and mice 
that develop inducible melanoma (B6.Cg-Braf"™'M™"/Pten'™ "Tg (Tyr-cre/ 
ERT2)13Bos/BosJ) were purchased from the Jackson Laboratory (Bar Harbour, 
Maine) and bred on-site. CD-1 outbred mice and FeyR /~ (B6.129P2- 
Feer1g'"!®%") mice were purchased from Taconic (Germantown, NY). 12-16- 
week-old male and female mice were sorted randomly into groups before assigning 
treatment conditions. No blinded experiments were conducted. All mice were 
maintained in an American Association for the Accreditation of Laboratory 
Animal Care-accredited animal facility. All protocols were approved by the 
Stanford University Institutional Animal Care and Use Committee under protocol 
APLAC-17466. 

Cell lines. The mouse lines B16F10 (melanoma), 4T-1.1 (breast cancer), LL/2 
(Lewis lung carcinoma) and RMA (lymphoma) were all purchased from the 
ATCC. LMP pancreas tumour cells were isolated from Kras@2>! *; LSL- 
Trp53™1774/*; Pdx-1-Cre mice in our laboratory as described!'. Cells were cul- 
tured in DMEM (Gibco, Carlsbad, California) supplemented with 10% heat-inac- 
tivated FCS, 2 mM L-glutamine, 100 U ml’ penicillin and 100 pg ml’ 
streptomycin (Gibco) under standard conditions. Cell lines were tested for myco- 
plasma contamination and endotoxin. 

Preparation and in vitro studies of mouse DC subsets. Bone marrow mono- 
nuclear cells were negatively selected using a murine monocyte enrichment kit (Stem 
Cell Technologies, Vancouver, Canada), and FSC’ssc!’Ly6Ch'cD115"MHCI 
cells were sorted with a FACS Aria II (BD Biosciences). Monocytes were cultured 
for 4-5 days in the presence of 50 ng ml ' GM-CSF (PeproTech) to generate DCs. 
For TADCs, tumours were digested in Hank’s balanced salt solution (HBSS, Gibco) 
containing 4 mg ml’ collagenase IV and 0.01 mg ml ' DNase I (Sigma). Cells were 
applied on a Ficoll gradient and magnetically enriched using CD11b™ selection kits 
(StemCells) and Ly6C” CD11c* MHCII" cells were sorted by FACS. In some experi- 
ments TADCs were activated with 1 jg ml” bacterial lipopolysaccharide (LPS), 
1 pg ml! high molecular mass polyinosinic-polycytidylic acid (poly(I:C)) (both 
from InvivoGen, San Diego, California), or with 50 ng ml! TNEFo or 50 ng ml! 
IFNy (PeproTech) in combination with 500 ng ml | CD40L, OX-40 (PeproTech) or 
500 ng ml | CD28 (R&D) recombinant mouse proteins. All in vitro activations of 
mouse DCs were independently repeated at least 10 times in duplicate. 
Preparation and in vitro studies of tumour cells, TADCs, autologous T cells 
and IgG from patients with cancer. Tumour cells, TADCs, peripheral blood T 
cells and IgG were obtained from two patients undergoing resection surgery for 
stage I lung carcinoma. Tumours were enzymatically digested with 0.1 mg ml of 
DNase I and 5 mg ml * collagenase IV (Sigma) in HBSS for 30 min. Tumour cells 
were enriched by sorting CD45-negative cells, fixed in 2% paraformaldehyde for 
20 min, washed extensively in PBS and coated for 30 min with autologous IgG or 
pooled allogeneic IgG obtained from healthy blood donors. To obtain TADCs, 
FSC’’SsC°"CD11¢* MHCII™ cells were sorted and maintained for 1 h in 10% 
FCS IMDM at 37 °C. For FACS and confocal studies, tumour DCs were incubated 
overnight with autologous tumour cells coated with self IgG or allolgG alone, or in 
the presence of 5 ng ml”! recombinant human TNF« and 500 ng ml”? CD40L 
(PeproTech). 

In separate experiments, 10-cm-long rib bones and 10 ml blood were obtained 
from two patients undergoing resection surgery for malignant pleural mesothe- 
lioma. To generate BMDCs, bones were flushed with PBS and mononuclear cells 
were separated on Ficoll gradients. CD34” cells were then enriched using mag- 
netic beads (Miltenyi) and cultured for 9-12 days in IMDM (Gibco) supplemented 
with 10% FCS, 50 ng ml~' human GM-CSF and 20 ng ml’ human IL-4 
(PeproTech). To obtain autologous tumour cells, tumours were enzymatically 
digested with 0.1 mg ml~' of DNase I and 5 mg ml collagenase IV (Sigma) in 
HBSS for 30 min. Tumour cells were enriched by sorting CD45-negative cells, 
fixed in 2% paraformaldehyde for 20 min, washed extensively in PBS and coated 
for 30 min with autologous or pooled allogeneic IgG. Autologous CD4* T cells 
were enriched from peripheral blood mononuclear cells on magnetic beads 
(Miltenyi) and IgG was isolated from each patient’s plasma using protein A 
columns (GE Healthcare). For T-cell proliferation assays, 2  10* DCs were 
incubated overnight with antibody-coated tumour cells as above, washed and 
co-cultured with 2 X 10° autologous CD4*-enriched T cells. After 6 days, cells 
were pulsed with (H]thymidine (1 Ci per well) and cultured for an additional 
18 h before being harvested in a Harvester 400 (Tomtec). Radioactivity was 
measured by a 1450 MicroBeta counter (LKB Wallac). T-cell proliferation was 
assayed in six technical replicates per sample. The human subject’s protocols 
were approved by Stanford’s Institutional Review Board, and informed consent 
was obtained from all subjects. 

Flow cytometry. For cell surface staining, monoclonal antibodies conjugated to 
FITC, PE, PE-Cy7, PE-Cy5.5, APC-Cy7, eFluor 650, or Pacific blue and specific for 
the following antigens were used: CD11b (M1/70), F4/80 (BM8), B220 (RA3-6B2) 
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from BioLegend (San Diego, California) and CD115 (AFS98), CD80 (16-10A1), 
I-Ab (AF6-120.1), CD40 (1C10), Ly6C (HK1.4), CD86 (GL1) from eBioscience 
(San Diego, California). All in vivo experiments to characterize tumour-infiltrating 
leukocytes were independently repeated at least three times with 3-5 mice per 
group. iTAg APC-labelled H-2K°-Trp-2(SVYDFFVWL) and iTAg PE-labelled 
H-2D>-gp100(EGSRN QDWL) tetramers were purchased from MBL international 
(Woburn, Massachusetts) and were used according to the manufacturer’s instruc- 
tions. Tetramer-staining experiments were repeated twice with five 5 mice in each 
group. For protein phosphorylation-specific flow cytometry, cells were activated 
for 5, 15 or 30 min with or without IC and fixed for 15 min with 1.8% para- 
formaldehyde. Cells were washed twice with PBS containing 2% FCS and incu- 
bated with 95% methanol at 4 °C for 20 min. Conjugated antibodies against 
phospho-p38 (Thr180/Tyr182) and phospho-JNK (Thr183/Tyr185) were pur- 
chased from Cell Signaling and phospho-ERK1/2 (p44) (pT202/pY204) from 
BD Biosciences. DC protein phosphorylation experiments were repeated five 
times, each with biological duplicates. For tumour-binding IgM and IgG, PE- 
conjugated anti-mouse IgM (RMM-1), anti-mouse IgG (Poli4052) and anti- 
human IgG (HP6017) were purchased from BioLegend. Flow cytometry was 
performed on a LSRII (BD Biosciences) and data sets were analysed using 
FlowJo software (Tree Star, Inc.). In vivo binding levels were tested in four inde- 
pendent experiments, 3-5 mice in each group. 

Intracellular IFNy staining. B16 tumours from treated mice were digested to 
obtain a single cell suspension. A total of 2 X 10° cells per well were cultured for 4h 
in 10% FCS RPMI containing 1X Brefeldin A (eBioscience) in a 96-well plate 
containing 4 X 10* BMDCs loaded with 10 jg of B16 membrane proteins. Cells 
were washed and stained for extracellular T-cell markers. Cells were then fixed and 
permeabilized using cytofix/cytoperm solutions (BD Bioscience) and stained with 
PE-Cy7 conjugated anti-IFNy antibody (XMG1.2, BioLegend). Experiments were 
repeated twice independently with 5 mice per group. 

Cytokine measurements. Cells were seeded at 1 X 10° cells ml”! and cultured for 
12 h with or without tumour immune complexes, or LPS (Sigma). TNFa, IFNy 
and IL-12 (p40/p70) in the supernatants were measured by ELISA, according to 
manufacturer’s instructions (R&D Systems, Minneapolis, Minnesota). Cytokine 
secretion was measured in biological triplicates in four independent experiments. 
IgG and IgM purification and measurement. Mouse antibodies were obtained 
from pooled 5-ml 20-24-week-old mouse serum by liquid chromatography on 
AKTA Explorer/100Air (GE Healthcare). Total mouse IgG and IgM were purified 
using protein-G and 2-mercaptopyridine columns, respectively (GE Healthcare). 
The levels of purified IgG and IgM were measured with specific ELISA kits (Bethyl, 
Montgomery, Texas) according to manufacturer’s instructions. The capacity of 
purified antibodies to bind tumour cells was tested by flow cytometry before their 
use in vivo. 1 jig IgG per 1 X 10° allogeneic tumour cells bound at least 8 times 
higher compared to isotype control antibodies. Serum levels of antibodies were 
measured in biological triplicates in four independent experiments. 

Necrotic and apoptotic tumour cell internalization experiments. For necrotic 
tumour cells, cultured LMP or B16 cells were trypsinized, washed and re-sus- 
pended at a concentration of 5 X 10° cells ml! in cold PBS (GIBCO). Cells were 
then subjected to three cycles of freeze-thaw between liquid nitrogen and a 37 °C 
water bath and the level of necrotic cells was determined by Trypan blue under 
light microscopy. Apoptotic tumour cells were prepared by their pre-incubation 
with 25 ug ml~' of mitomycin C (Sigma) for 1 h in antibiotic and serum-free 
DMEM. Fluorescein-labelled E. coli BioParticles were purchased from Life 
Technologies and used according to the manufacturer’s instructions. Dendritic 
cell activations with above cells were repeated four independent times in biological 
duplicates. 

Preparation of antibody-tumour lysate immune complexes and antibody- 
bound tumour cells. When obtained from surgical resections, tumour cells were 
initially isolated after enzymatic digestion and sorted as FSC"'CD45~ cells before 
their fixation and staining. For tumour-antibody complexes, tumour cells were 
fixed in 2% paraformaldehyde, washed extensively and incubated with 1-3 pg 
syngeneic or allogeneic IgG or IgM per 1 X 10° tumour cells, and were then 
washed to remove excess antibodies. To obtain tumour lysate immunoglobulin 
immune complexes, tumour cells were incubated for 30 min on ice with 1-3 pg 
syngeneic or allogeneic IgG or IgM per 1 X 10° tumour cells, washed from excess 
antibodies and further disrupted with non-denaturing lysis buffer (Pierce) to 
obtain immunoglobulin immune complexes. Dendritic cell activations with the 
above immunoglobulin immune complexes were repeated in at least 10 independ- 
ent experiments in biological duplicates. 

Absorption of allogeneic IgG on normal cells. Skin and pancreas were removed 
from naive C57BL/6 or 129S1 mice and enzymatically digested with 0.1 mg ml! 
of DNase I (Sigma) and 4 mg ml ' collagenase IV (Sigma) in PBS to obtain single 
cell suspensions. Splenocytes were isolated by mashing spleens through 70 um 
cell strainers. Cells were then mixed at 1:1:1 ratio and extensively washed and 
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incubated with 0.5 ug per 1 X 10° cells FcyR block (BD) and 5% (W/V) BSA 
(Sigma) in PBS for 15 min on ice. Cells were then washed and incubated with 
allogeneic IgG (2 ig per 1 X 10° cells) for 30 min on ice. Cells were centrifuged at 
5,000 r.p.m. for 10 min, and the supernatants were concentrated by 50 kDa 
centrifugal filters (Amicon) before being incubated with 1 X 10° tumour cells. 
Membrane protein extraction. For native membrane protein extraction, B16F10 
cells were scraped in cold PBS and pelleted at 400g for 5 min at 4 °C. The cell pellet 
was washed twice in cold PBS, resuspended in 10 mM HEPES pH 7.4 and incu- 
bated on ice for 10 min. Cells were pelleted and the buffer was removed. The cell 
pellet was resuspended in 10 ml of SEAT buffer (10 mM triethanolamine/acetic 
acid, 1 mM EDTA pH 8.0, 250 mM sucrose, protease inhibitor cocktail) and 
homogenized with 20 strokes of a dounce homogenizer. The sample was spun 
at 900g for 6 min to collect the post-nuclear supernatant (PNS). The PNS was spun 
at 100,000g for 60 min at 4 °C to harvest a membrane pellet, which was then 
resuspended in 4 ml membrane extraction buffer (MEB) containing 50 mM Tris- 
HCl pH 8.0, 150 mM NaCl, 1% NP-40, 1 mM DTT, 10% glycerol, 1 mM NaF, and 
protease inhibitor cocktail. After incubation for 2 h at 4 °C, the membrane extract 
was clarified by centrifugation at 100,000g for 30 min at 4 °C. For denatured 
membrane protein extraction, the membrane pellet was resuspended in 500 pl 
Radio-Immuno-Precipitation Assay buffer (RIPA, Sigma) and lysed with a 25G 
needle syringe. Lysates were incubated at 4 °C for 1 h and spun at 100,000g, 30 min, 
4 °C. Supernatant containing detergent solubilized membrane proteins was col- 
lected and boiled for 5 min at 95 °C. Deglycosylation of membrane proteins was 
performed using a commercial kit (New England Biolabs, Ipswich, Massachusetts) 
according to the manufacturer’s instructions. Isolation of cell-membrane proteins 
was repeated three independent times and the running pattern of precipitated 
proteins was compared on SDS-PAGE. 

Immunoprecipitation and mass spectrometry. Immunoprecipitation was set up 
with 20 mg membrane extract and 50 ug of syngeneic or allogeneic IgG coupled to 
protein G magnetic beads and incubated for 16 h at 4 °C. Beads were washed thrice 
with MEB and bound protein complexes were eluted with 2 Laemmli buffer. The 
eluted sample was subjected to SDS-PAGE on a 4-12% Bis-Tris gel followed by 
GelCode blue staining (Thermo Scientific) to visualize protein bands. Protein 
bands were excised, digested with trypsin and analysed (MS Bioworks) using a 
nano LC/MS/MS with a NanoAcquity HPLC system (Waters) interfaced to a Q 
Exactive (Thermo Fisher). The mass spectrometer was operated in data-depend- 
ent mode, with MS and MS/MS performed in the Orbitrap at 70,000 FWHM and 
17,500 FWHM resolution, respectively. The 15 most abundant ions were selected 
for MS/MS. The data were processed with the Mascot Server (Matrix Science). 
Mascot DAT files were parsed into the Scaffold software for validation, filtering 
and to create a non-redundant list per sample. Data were filtered at 1% protein and 
peptide FDR, requiring at least two unique peptides per protein. Mass spectro- 
metry analysis of precipitated proteins was performed once. 

Native gel and tumour cell GPNMB staining. Recombinant mouse GPNMB 
(R&D) was mixed with native loading buffer (16% glycerol, 1% Trypan blue and 
50 mM pH 7.0 Tris-HCl) and 62.5 and 125 ng per well was run for 2 h in Novex 
NativePAGE Bis-Tris gel system (Life Technologies) on ice. Bands were trans- 
ferred to a nitrocellulose membrane and incubated overnight with 10 pg ml! 
mouse IgG, or with 1 pg ml * rabbit polyclonal IgG anti-mouse GPNMB (cat. no. 
S-24 sc-133634, Santa Cruz). The membranes were washed, incubated for 45 min 
with goat anti-mouse IgG light-chain-specific antibodies conjugated to HRP 
(Pierce), developed with SuperSignal West Femto Substrate (Pierce), and exposed 
together for imaging. 

For FACS staining of GPNMB on tumour cells, 1 X 10° B16 or LMP cells were 
incubated with 2 ig rabbit polyclonal anti-mouse GPNMB (Santa Cruz) for 
30 min, washed twice and incubated for 20 min with PE-conjugated donkey anti- 
rabbit or goat anti-mouse antibodies, respectively (both from eBioscience). FACS 
measurements were repeated three independent times in biological duplicates. 

In vivo tumour models. For tumour challenge studies, 2 x 10° and 5 X 10* LMP 
or B16 tumour cells, respectively, were injected subcutaneously (s.c.) above the 
right flank, and tumour development was measured twice a week with calipers. In 
some experiments, 1-2 X 10° tumour cells were labelled with 25 pM CFSE 
according to the manufacturer’s instructions (Invitrogen). Tumour challenge 
experiments were repeated independently at least 8 times with 4 mice per group. 
For prophylactic immunization, mice were injected twice s.c., 7 days apart, with 
2 X 10° DCs or monocytes that were loaded with tumour lysates or immune 
complexes. This was independently repeated 3 times with 4 mice per group. For 
tumour recurrence studies, 2 X 10° tumour cells were injected s.c. above the right 
flank, and the size of growing tumours was measured using calipers. When 
tumours reached 45-55 mm” for LMP and 12-16 mm? for B16, mice were anaes- 
thetized and visible macroscopic tumour was surgically removed. Resected 
tumours were enzymatically digested with 0.1 mg ml’ of DNase I (Sigma) and 
5 mg ml ‘ collagenase IV (Sigma) in HBSS. Cells were then fixed in 2% 


paraformaldehyde for 20 min, washed extensively in PBS and coated for 30 min 
with syngeneic or allogeneic antibodies. In some experiments, tumour cells were 
coated with mouse anti-mouse anti-H2-K® (2)1g/1x10" cells) or its isotype control 
(C1.18.4, both from BioXcell). Antibody-coated tumour cells were then washed 
and added to DC cultures. After overnight incubation, DCs were washed and 
2.5 X 10° were injected s.c. to tumour-resected mice one day after the tumours 
were removed, adjacent to the site of tumour resection. This experiment was 
repeated independently at least 3 times with 4 mice per group. For in vivo tumour 
treatments, a combination of 2 ug TNFa (Peprotech) and 100 jig agonistic anti- 
CD40 (FGK4.5, BioXcell), 5 jug recombinant CD40L (PeproTech), 5 ug CD28 
(R&D Systems, Minneapolis, Minnesota), 5 pg LPS or 200 pg poly(I:C) 
(Invivogen), and 400 jig mouse allogeneic or syngeneic IgG or anti-GP-NMB 
(Santa Cruz), was injected twice (2 days apart) directly into tumours. Experi- 
ments were repeated independently at least 5 times with 4-5 mice per group. 
For treatment of the BrafY°"" melanoma model, mice were injected twice (2 days 
apart) in 2 cycles, one week apart, with 1 mg IgG derived from CD-1 mice along 
with TNFo. and anti-CD40 once the largest tumour nodule reached 16 mm”. For 
metastasis experiments, 1 X 10° 4T11 cells were injected into the mammary fat pad 
of syngeneic Balb/c mice. After 14-16 days, once tumours metastasized into the 
draining lymph node, the primary tumour nodules were injected twice (2 days 
apart) in 2 cycles, one week apart, with 1 mg IgG derived from CD-1 mice along 
with TNFa and anti-CD40. Experiments were repeated independently at least 
3 times with 3-5 mice per group. 

In vivo binding of PE-labelled allogeneic IgG. Allogeneic antibodies were fluor- 
escently labelled with PE using Lightning-Link kits according to the manufac- 
turer’s instructions (Innova Biosciences Ltd, Cambridge, UK). Subsequently, 5 ig 
of labelled allogeneic IgG was injected intratumorally alone or with TNFa and 
anti-CD40. After 2 h, tumours were enzymatically digested to obtain a single cell 
suspension and the PE levels were analysed by flow cytometry along with lineage 
markers. 

Covalent binding of syngeneic antibodies to tumour cells. Syngeneic IgG was 
cross-linked to primary amines of B16 cell surface proteins using sulfo-LC-SPDP 
(sulfosuccinimidyl 6-(3'-[2-pyridyldithio]-propionamido) hexanoate, Pierce) 
according to the manufacturer’s instructions. Briefly, both the antibodies and cells 
were initially treated with sulfo-LC-SPDP to label primary amines. Next, disulfide 
bonds in syngeneic IgG were reduced by treatment with DTT. Finally, the reduced 
syngeneic IgG was incubated with SPDP-labelled B16 cells and the level of binding 
was later assessed by flow cytometry. Experiments were repeated independently 
3 times with 4 mice per group. 

In vivo cell depletion. Depletion of CD4* and CD8* T cells was achieved by 
intraperitoneal (i.p.) injection of 500 ug per mouse GK1.5 (anti-CD4) and YST- 
169.4 (anti-CD8) monoclonal antibodies (both from BioXcell, West Lebanon), 
respectively, 3 days before tumour inoculation and every 3 days thereafter. T-cell 
depletion experiments were repeated independently 3 times for each depletion 
antibody with 3-4 mice per group. In some experiments, B16-bearing mice were 
injected with 500 yg per mouse anti-CD8 or anti-CD4 2 days before their treat- 
ment with antibodies+TNFa+anti-CD40 and once a week thereafter. These 
T-cell depletion experiments were repeated independently 2 times for each deple- 
tion antibody with 5 mice per group. For B cell depletion, 300 Jug per mouse anti- 
CD19 (1D3) and 300 1g per mouse anti-B220 (RA3.3A1/6.1) (both from BioXcell) 
were injected i.p. three weeks before tumour inoculation and every 5 days there- 
after. B-cell depletion experiments were repeated independently 3 times with 
3-5 mice per group. For NK cell depletion, mice were injected ip. with 50 pl 
anti-asialo (GM1) polyclonal antibody (Wako Chemicals, Richmond, Virginia), 
or with 200 jg anti-NK1.1 (PK136) (BioXCell) on days —2, 0, 4 and 8 relative to 
tumour challenge. Individual mice were bled on days 0,7, 14 and 21 and the levels 
of NK1.1*/CD3e~ cells were determined by flow cytometry to confirm depletion. 
NK cell depletion experiments were repeated independently 3 times with anti- 
asialo depletion antibody with 3-5 mice per group. 

Adoptive transfer. Mice were injected i.v. with 1 mg per mouse of syngeneic or 
allogeneic IgG or IgM one day before tumour challenge and once again with 
tumour injection. For T-cell transfer, splenic CD4* and CD8" T cells were nega- 
tively selected using a murine enrichment kit (Stem Cell Technologies) and at least 
5 X 10° cells were injected iv. to recipient mice one day before tumour challenge. 
T-cell adoptive transfer experiments were repeated independently 3 times for each 
T cell subset with 3-5 mice per group. Prior to their transfer, tumour-associated 
cell subsets were enriched as follows: TADCs were isolated by enrichment of 
MHCII* cells on magnetic beads (Miltenyi) and subsequent sorting of 
Ly6C” CD11¢* CD644" by FACS. Tumour macrophages were enriched with 
CD11b* magnetic beads (Miltenyi) followed by sorting of Ly6C” CD64” cells. 
B cells were enriched with CD19* magnetic beads (Miltenyi). NK cells were 
enriched with NK1.1* magnetic beads (Miltenyi), and mast cells were enriched 
with c-kit’ magnetic beads (Miltenyi). For each cell subset, 2 x 10° cells were 
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injected s.c. into naive mice 3 days before being challenged with 5 x 10* B16 
tumour cells. Transfer experiments for each cell type were repeated 3 times inde- 
pendently with 3-5 mice per group. 

T-cell proliferation. 3 X 10* DCs were co-cultured with 3 X 10° MACS-enriched 
CD4* T cells (Miltenyi, Germany) from spleens of LMP- or B16-immunized mice. 
After 6 days, cells were pulsed with [*H]thymidine (1 ,1Ci per well) and cultured for 
an additional 18 h before being harvested in a Harvester 400 (Tomtec). Radioactivity 
was measured by a 1450 MicroBeta counter (LKB Wallac). T-cell proliferation was 
repeated 5 times with 3 biological replicates and 6 technical replicates for each. 

In vivo BrdU incorporation. Tumour-challenged mice were injected i.p. every 
day with 1 mg of 5-bromo-2-deoxyuridine (BrdU) in 200 pl PBS. At several time 
points, mice were killed and single cell suspensions were prepared from BM, lymph 
nodes and tumour tissues. Cells were then stained for lineage markers followed by 
intracellular staining with FITC-conjugated anti-BrdU antibody according to man- 
ufacturer’s instructions (BD Pharmingen) and analysed by flow cytometry. 
Experiments were repeated independently 3 times with 3-5 mice per group. 
Immunofluorescence. DCs or monocytes were incubated on glass-bottom cul- 
ture plates (In vitro Scientific) with CFSE-labelled tumour cells with or without 
antibodies overnight. Cells were gently washed with PBS (Gibco), fixed for 20 min 
with 2% paraformaldehyde and permeabilized with 0.5% saponin (Sigma). 
Samples were blocked with 10% non-immune goat serum and stained with 
Alexa-conjugated anti-mouse IgG and IgM (Invitrogen 1:100) and anti-mouse 
I-Ab (BD Biosciences, 1:100). DC immunostainings were independently repeated 
at least 3 times in biological duplicates and 3 fields were documented in each slide. 
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Immunohistochemistry. Specimens were fixed in 4% paraformaldehyde, equili- 
brated in a 20% sucrose solution and embedded in frozen tissue matrix (Tissue- 
Tek OCT, Torrance, California). Slides were cut to 5 um, blocked with 10% non- 
immune goat serum and stained with goat anti-mouse IgG (Invitrogen 1:100) and 
anti-mouse IgM (II/41 eBioscience, 1:100). Sections were examined under a Zeiss 
Laser Scanning Confocal Microscope. Images were collected using a Zeiss 700 
confocal laser scanning microscope, and analysed using ZEN software (Carl 
Zeiss Microscopy). Tumour immunostainings were repeated independently at 
least 3 times in biological duplicates and 3 fields were captured for each slide. 
Statistics. No statistical methods were used to predetermine sample size, but 
sample size was chosen such that statistical significance could be achieved using 
appropriate statistical tests (for example, ANOVA) with errors approximated 
from previously reported studies. A non-parametric Mann-Whitney U test was 
performed in Prism (GraphPad Software, Inc.) to analyse experimental data, 
unless otherwise stated. Phospho-specific flow cytometry data were transformed 
by taking the inverse hyperbolic sine (arcsinh), and ratios were taken over the 
corresponding baseline (unstimulated) value*'. No blinded experiments were per- 
formed. No samples were excluded from analyses. P values indicate significance 
of the difference between experimental and control (CT) values. *P < 0.05; 
**P < 0.01. Error bars represent + s.e.m. 


31. Irish, J. et a/. B-cell signaling networks reveal a negative prognostic human 
lymphoma cell subset that emerges during tumor progression. Proc. Natl. Acad. 
Sci. USA 29, 12747-12754 (2010). 
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Extended Data Figure 1 | DCs acquire an activated phenotype in response to 
allogeneic tumours injected in vivo, but not when co-cultured in vitro. 

a, LMP (left) and B16 (right) growth in 129S1, C57BL/6, or allogeneic hosts pre- 
treated with anti-asialo-GM1 or anti-NK1.1 antibodies (n = 6, 3 independent 
experiments). Shown are representative plots of NK cells in the blood before 
tumour challenge. b, BrdU incorporation by CD4* T cells (top) and CD8* T cells 
(bottom) in lymphoid organs of 129S1 and C57BL/6 LMP-bearing mice (n = 8, 3 
independent experiments). c, Representative flow cytometric analysis of 
CD11b™Ly6C™ myeloid cells and mature DCs (mDCs) on day 10 after C57BL/6 
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mice were inoculated with B16 tumour cells. d, Flow cytometric analysis of 
Ly6C” CD11c*MHCII" cells from LMP-bearing mice (left) and B16-bearing 
mice (right). Histograms show representative expression levels of co-stimulatory 
molecules on DCs from C57BL/6 and 12981 mice (n = 8, 3 independent 
experiments). e, IL-12 (right) and TNFz (left) in the supernatants of syngeneic 
BMDCs, syngeneic blood monocyte-derived (Mo) DCs, allogeneic BMDCs or 
Mo-DCs incubated with live, frozen-thawed (necrotic), or mitomycin-C-treated 
(apoptotic) LMP cells or E. coli BioParticles overnight (n = 8, 4 independent 
experiments). Shown are the mean values + s.e.m. *P < 0.05; **P < 0.01. 
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Extended Data Figure 2 | Allogeneic hosts have a much higher titre of 
tumour-binding antibodies compared to syngeneic hosts a, Flow cytometric 
analysis of the binding of various concentrations of IgG from 129S1, IgM from 
12981, IgG from C57BL/6 and IgM from C57BL/6 mice to LMP and B16 cells. The 
lower panel shows a representative histogram of IgG (left) or IgM (right) binding 
after incubation of 1 jg of C57BL/6 or 129S1 antibodies with 1 X 10° LMP (upper) 
or B16 (lower) cells (n = 8, 4 independent experiments). b, The left panel shows a 
representative histogram of the MFI of IgG after incubation of 2 1g of either 
control antibody (secondary Ab) or IgG from the serum of naive C57BI/6 mice, 
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B16-bearing C57BL/6 mice on day 7, B16-bearing C57BL/6 mice on day 14 or 
naive 129S1 mice with 1 X 10° B16 cells (n = 6, 4 independent experiments). 
Right graph shows MEI of the binding of 2 1g of each IgG to 1 X 10° B16 cells. 
c, Serum levels of IgG (left) and IgM (right) in C57BL/6 and 129S1 mice following 
ip injection with anti-B220 and anti-CD19 antibodies (n = 8, 3 independent 
experiments). d, LMP tumour size in naive 129S1 mice injected with allogeneic 
IgG, allogeneic IgM, syngeneic IgG or syngeneic IgM on days —1 and 0 relative to 
tumour injection (n = 6, 3 independent experiments). Shown are the mean 
values + s.e.m. *P < 0.05; **P < 0.01. 
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Extended Data Figure 3 | Activation of BMDCs with immune complexes 
induces transferable T-cell immunity. a, Mean levels of CD40 and CD86 
expression (left) and IL-12 secretion (right) in BMDCs from C57BL/6 (WT) and 
FcyR KO mice activated with IgG-IC overnight (n = 6, 10 independent 
experiments). b, Proliferation of CD4* T cells cultured with BMDCs from 
C57BL/6 and FcyR KO mice loaded with IgG-IC (n = 4, 5 independent 
experiments). c, Tumour recurrence in untreated mice, mice treated with WT 
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BMDCs loaded with IgG-IC, or mice treated with FcyR KO BMDCs loaded with 
IgG-IC (n = 8, 3 independent experiments). d, e, Percentages of tumour-free mice 
after adoptive transfer of 5 X 10° splenic CD4* T cells (left graph) or CD8* T cells 
(right graph) from naive mice, or from LMP (d)- or B16 (e)-resected mice treated 
with DCs + IgGes7 IC, DCs + IgMes7 IC, DCs + IgGy29 IC, or DCs + IgM j29 IC, 
and subsequently challenged with LMP (d) or B16 (e) (n = 6, 3 independent 
experiments). Shown are the mean values + s.e.m. **P < 0.01. 
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Extended Data Figure 4 | Tumour-associated DCs do not respond to 
immune complexes. a, Sorting and culture schema of DCs from BM and 
tumour. b, Mean levels of IL-12 (left) and TNF« (right) in the supernatants of 
DCs cultured overnight in medium alone, with B16 lysates, or with allolgG-IC 
(n = 6, 4 independent experiments). c, Percentage of MHCII* CD86° cells 
(left) or CFSE levels (right) in tumour-associated DCs after overnight activation 
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with PBS or CFSE-labelled allolgG-IC with or without stimulatory molecules (n 
= 12, 10 independent experiments). d, Representative flow cytometric analysis 
and confocal images from one out of three independent experiments of B16- 
derived DCs cultured overnight with CFSE-labelled fixed B16 cells (n = 8, 10 
independent experiments). Shown are the mean values + s.e.m. *P < 0.05; 
**P < 0.01. 
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Extended Data Figure 5 | Tumour DCs from mice treated with allolgG + 
adjuvant can internalize immune complexes and transfer immunity. a, B16 
tumour size in C57BL/6 mice left untreated or injected intratumorally with 129S1 
allogeneic IgG, LPS, TNFa. + CD28, LPS + allogeneic IgG or TNFa + CD28 + 
allogeneic IgG (n = 15, 3 independent experiments). b, B16 tumour size in 
C57BL/6 mice left untreated or injected intratumorally with 129S1 allogeneic IgG, 
TNFa, CD28, or CD40L (n = 12, 3 independent experiments). c, Lewis lung 
carcinoma (LL/2) tumour size in C57BL/6 mice left untreated, or injected 
intratumorally with 129S1 allogeneic IgG, TNFa + CD40L, TNFa + CD28, 
TNFa + CD40L + 129S1 allogeneic IgG or TNFa + CD28 + 12981 IgG (n = 8, 


2 independent experiments). d, Representative flow cytometric analysis from one 
out of three independent experiments of IgG binding total myeloid cells in B16 
tumour-bearing mice 3 h after intratumoral injection of PBS or 5 jig PE-labelled 
allogeneic IgG. e, Total numbers of CD11c* cells in the draining lymph nodes of 
B16 tumour-bearing mice 4 days after treatment (n = 6, 3 independent 
experiments). f, Gating and sorting strategy of immune cell populations 
infiltrating B16 tumours. g, B16 growth in mice vaccinated with 2 10° B cells, 
mast cells, macrophages or NK cells from B16 tumours untreated, or injected with 
allogeneic IgG or allogeneic IgG + TNFa + anti-CD40 (n = 6, 3 independent 
experiments). Shown are the mean values + s.e.m. *P < 0.05; **P < 0.01. 
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Extended Data Figure 6 | Allogeneic IgG recognises non-mutated cell 
membrane proteins on tumour cells. a, B16 frequency in mice untreated, or 
treated with BMDCs loaded with intact B16 cells coated with allogeneic IgG, or 
with intact B16 cells cross-linked to syngeneic IgG (n = 8, 3 independent 
experiments). b, B16 tumour frequency in mice untreated or treated with BMDCs 
loaded with intact B16 cells coated with allogeneic IgG or with intact B16 coated 
with monoclonal IgG against MHC-I (n = 8, 3 independent experiments). 

c, RMA tumour growth following inoculation with 2.5 X 10° tumour cells in naive 
C57BL/6 mice, or in C57BL/6 mice in which B16 tumours had completely 
regressed after treatment with allogeneic IgG + TNFo. + anti-CD40. Also shown 
is the lack of B16 tumour growth in C57BL/6 mice that were re-challenged with 2 
X 10° B16 tumour cells following the regression of this tumour after treatment 
with allogeneic IgG + TNFo + anti-CD40 (n = 8, 2 independent experiments). 
d, Left: tumour frequency in mice untreated or treated with DCs loaded with 
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immune complexes formed with allogeneic IgG and cytosolic tumour proteins, 
nuclear tumour proteins or membrane tumour proteins. Right: tumour frequency 
in mice untreated, treated with DCs loaded with immune complexes formed from 
allogeneic IgG and membrane proteins, membrane proteins without O- and 
N-glycans, or heat-denatured membrane proteins (n = 5, 3 independent 
experiments). e, B16 tumour growth in C57BL/6 mice untreated, or injected with 
TNFa + anti-CD40, TNFa + anti-CD40 + allogeneic IgG, or TNFa + anti- 
CD40 and allogeneic IgG absorbed on normal cells of the IgG-donor background 
(blue diamonds) or on normal cells of the tumour background (green squares) 
(n = 6, 3 independent experiments). f, Tumour recurrence rates after resection in 
mice left untreated, treated with 2 X 10° DCs loaded with IgG-IC from 
conventionally raised C57BL/6, or with 2 X 10° DCs loaded with IgG-IC from 
gnotobiotic C57BL/6 mice (n = 6, 2 independent experiments). Shown are the 
mean values + s.e.m. *P < 0.05; **P < 0.01. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a GP-NMB 
M2pg IgG 
I ~ 1pg IgG 
Spleen ~ | i Secondary Ab 
=" 
A ™ 
Skin |“ / 
At. ~ 
” ~ 
[ /\ 
B16 Ti 
b 
PBS _ aGP-NMB 
4.6 87 & 
ie] 
B16 “ ea : 
2.8 5.3 8 20 
o = 
LMP © x 
a @ * 0 
1S) LE. SLL 
MHCII Ne 9? 
oOo 
vor 
oS 
c 
129-IgG _C57-IgG_ aGP-NMB 


M 62ng 125ng M 62ng 125ng M 62ng 125ng 


~~” a 


Extended Data Figure 7 | Allogeneic hosts have a higher titre of anti-GP- 
NMB IgG, which can be used to induce tumour immunity. a, Representative 
flow cytometric analysis and quantification of binding of anti-IgG 
secondary antibody alone, 1 jg anti-GPNMB or 2 1g GPNMB per 1 X 10° 
B16 cells, normal skin cells, or normal spleen cells (n = 6, 3 independent 
experiments). b, Percentage of MHCII* CD86" BMDCs following 
overnight activation with untreated LMP or B16 tumour cells, or with 
tumour cells coated with anti-GPNMB (2 ig per 1 X 10° tumour cells) 
(n = 8, 3 independent experiments). c, Western blot of recombinant 
GPNMB (62.5 ng and 125 ng) performed with 10 pg ml! of IgG from naive 
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129S1 mice, naive C57BL/6 mice, or 1 pg ml ! anti-GPNMB (2 
independent experiments). d, B16 tumour size in mice untreated or treated 
with TNFa + anti-CD40, allogeneic IgG, anti-GPNMB IgG, TNFa + anti- 
CD40 + allogeneic IgG, or with TNFa + anti-CD40 + anti-GPNMB 

(n = 8, 3 independent experiments). e, B16 tumour size in C57Bl/6 WT 
mice untreated or treated with TNFa + anti-CD40, TNFa + anti-CD40 + 
allogeneic IgG, or with TNFa + anti-CD40 + anti-GPNMB, or in FcyR KO 
mice treated with TNFa + anti-CD40 + allogeneic IgG, or with TNFa + 
anti-CD40 + anti-GPNMB (n = 8, 3 independent experiments). Shown are 
the mean values + s.e.m. *P < 0.05; **P < 0.01. 
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Extended Data Figure 8 | AllolgG and anti-GPNMB IgG induce tumour- 
reactive T-cell infiltration and activation. a, Representative flow cytometry 
plots of CD4* and CD8" cells in B16 tumours 6 days after treatment. Left: 
percentage of CD45” cells infiltrating B16 tumours 15-17 days after s.c. 
inoculation or 6 days after treatment. Right: percentage of CD4* and CD8* 
cells among tumour-infiltrating CD45” cells (n = 10, 3 independent 
experiments). b, Percentages of CD44 and IFNy co-expressing CD4* and 
CD8* cells among tumour-infiltrating CD45* cells 6 days after treatment 
or 15 days following s.c. inoculation (n = 10, 3 independent experiments) 
c, Frequency of IFNy-expressing T cells that recognize gp100 and Trp2 
among day 6 post-treatment tumour-infiltrating CD8* cells. Gate shown: 
CD8* T cells (n = 10, 3 independent experiments). d, Percentage of 
tumour-free mice following adoptive transfer of T cells from day 6 


post-treatment B16 tumour-bearing mice untreated, treated with TNFa + 
anti-CD40, with TNFa + anti-CD40 + allogeneic IgG, or with TNFa + 
anti-CD40 + anti-GPNMB (n = 9, 3 independent experiments). e, Upper 
left: B16 tumour growth in untreated C57BL/6 mice injected with rat IgG, 
with rat anti-CD4, or with rat-CD8. Upper right: B16 tumour growth in 
C57BL/6 mice treated with TNFa + anti-CD40 and injected with rat IgG, 
with rat anti-CD4, or with rat-CD8. Lower left: B16 growth in C57BL/6 
mice treated with TNFa + anti-CD40 + allogeneic IgG and injected with 
rat IgG, with rat anti-CD4, or with rat-CD8. Lower right: B16 growth in 
C57BL/6 mice treated with TNFa + anti-CD40 + anti-GPNMB and 
injected with rat IgG, with rat anti-CD4, or with rat-CD8 (n = 9, 3 
independent experiments). Shown are the mean values + s.e.m. from three 
independent experiments. *P < 0.05; **P < 0.01. 
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Extended Data Figure 9 | AllolgG can activate human tumour DCs to cytometry plots (d) of TADCs from a lung carcinoma patient incubated 
process autologous tumour cells. a, Representative haematoxylin and eosin _ overnight with autologous CFSE-labelled tumour cells (green) coated with self 
sections of lung metastases on day 30 from one out of three independent IgG or allogeneic IgG derived from a pool of 10 donors (1 jig per 2 X 10° cells) 


experiments performed (original magnification, 10X). b, MEI of tumour cells and in the presence of 50 ng ml_' TNFo and 1 ug ml | CD40L (n = 2 in 
from MSTO-resected patients coated with autologous IgG or IgG from healthy __ technical triplicates). Shown are the mean values + s.e.m. from two 
donors (n = 2 in technical triplicates). c, d, Wide-field microscopy (c) and flow independent experiments. *P < 0.05; **P < 0.01. 
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Extended Data Table 1 | Allogeneic IgG binds numerous membrane tumour proteins 


57/129 
Identified Proteins Accession Number C57 SpC|129 SpC|C57 SAF |129SAF |C57 NSAF |129 NSAF ratio 
Endoplasmic reticulum membrane 
1|Transmembrane protein 93 sp|Q9CQW0|TMM93_MOUSE 0} vA O}| 0.166667 QO} 0.0007367| 2 
2|Endoplasmic reticulum-Golgi intermediate compartment protein 3 sp|Q9CQE7|ERGI3_MOUSE 0} 2| O} 0.046512 0} 0.00020559} 2 
3}Reticulon-4 sp|Q99P72|RTN4_MOUSE 0} 2 O| 0.015748 0| 6.9609E-05 2 
4|Uncharacterized protein C120rf41 homolog sp|Q8BQR4|CL041_MOUSE 0} 2| 0} 0.037037 0} 0.00016371 2 
5/Erlin-2 sp|Q8BFZ9|ERLN2_ MOUSE 0} 2| 0} 0.052632 0} 0.00023264| 2 
6|Transitional endoplasmic reticulum ATPase sp|Q01853| TERA_MOUSE 0} 2 0} 0.022472 0} 9.933E-05| 2 
7|Dolichyl-diphosphooligosaccharide--protein glycosyltransferase subunit DAD1 _|sp|P61804;DAD1_MOUSE 0} 2| O} 0.166667 QO} 0.0007367 2 
8/Calnexin sp|P35564|CALX_MOUSE 0} 2| 0} 0.029851 0} 0.00013195| 2 
9|Calumenin sp|O35887|CALU_MOUSE 0 2 0} 0.054054 0} 0.00023893} 2 
10] Vesicle-associated membrane protein-associated protein A sp|Q9WV55| VAPA_MOUSE 0} 3] O} 0.107143 0} 0.00047359) 3] 
11|Mannosyl-oligosaccharide glucosidase sp|Q80UM7|MOGS_MOUSE 0} 3] 0} 0.032609 O} 0.00014414| 3] 
12|Neutral alpha-glucosidase sp|Q8BHN3|GANAB_MOUSE i) 3 0} 0.028037 0} 0.00012393} 3 
13|ERO1-like protein alpha sp|Q8R180|ERO1A_MOUSE 0 5| O| 0.092593 0} 0.00040928 5| 
14|UDP-glucose:glycoprotein glucosyltransferase 1 sp|Q6P5E4| UGGG1_MOUSE 0} 5] 0} 0.028409 O} 0.00012557| 5] 
15|Prolyl 4-hydroxylase subunit alpha-1 sp|Q60715|P4HA1_MOUSE 0 5 0} 0.081967 0} 0.00036231 5 
16|Epoxide hydrolase 1 sp|Q9D379|HYEP_MOUSE 0 9 O| 0.169811 O| _0.0007506 9 
17|Calreticulin sp|P14211|CALR_MOUSE 0} 14 O} 0.291667 0} 0.00128922) 14 
18]Sarcoplasmic/endoplasmic reticulum calcium ATPase sp|055143|AT2A2_ MOUSE 8 18] 0.06956522| 0.156522] 0.0003684} 0.00069185] 1.8782025) 
19|Protein disulfide-isomerase A4 sp|P08003|PDIA4_ MOUSE te) 12) O| 0.166667 O| _0.0007367 12 
20|Protein disulfide-isomerase sp|P09103|/PDIA1_MOUSE 0} 12) O} 0.210526 0} 0.00093056} 12 
21|Protein disulfide-isomerase A3 sp|P27773|PDIA3_MOUSE 0} 9 0} 0.157895 0} 0.00069792| 9| 
22|Protein disulfide-isomerase A6 sp|Q922R8|PDIA6_MOUSE 0} 11 O} 0.229167 0} 0.00101296 11 
Melanosomes and vesicles membranes 
1|Peptidyl-prolyl cis-trans isomerase B sp|P24369|PPIB_MOUSE 0} 7 0} 0.291667 O} 0.00128922 md 
Cell membrane 
1|T-complex protein 1 subunit gamma sp|P80318|TCPG_MOUSE 0} 2| O} 0.032787 O} 0.00014492| 2 
2|Monocarboxylate transporter 4 sp|P57787|MOT4_MOUSE 0} 2| 0 0.04; O} 0.00017681 2 
3}Nicastrin sp|P57716|NICA_MOUSE 0} 2 0} 0.025641 0} 0.0001 1334! 2 
4|Basigin sp|P18572|BASI_MOUSE 0} 2 O| 0.047619 0} 0.00021048 2 
5|Vesicle-associated membrane protein-associated protein A sp|Q9WV55| VAPA_MOUSE i?) 3] O}| 0.107143 0} 0.00047359) 3] 
6|Retrovirus-related Env polyprotein from Fv-4 sp|P11370|ENV2_MOUSE 0} 3] O} 0.040541 O} 0.0001792! 3] 
7|Synaptic vesicle membrane protein sp|Q62465| VAT 1_MOUSE 0} 4 0} 0.093023 O} 0.00041118} 4 
8/4F2 cell-surface antigen heavy chain sp|P10852|4F2_MOUSE 0} 4 0} 0.068966 0} 0.00030484| 4 
9|Alpha-enolase sp|P17182|/ENOA_MOUSE ie) 5 0} 0.106383 0} 0.00047023} 5] 
10|Integrin-linked protein kinase sp|O55222|ILK_MOUSE 0} 4 0} 0.078431 0} 0.00034668} 4 
11|Transmembrane glycoprotein NUB sp|Q99P91|GPNMB_MOUSE 2| 15) 0.03125} 0.234375] 0.0001655} 0.00103598} 6.2606749) 
12|MLV-related proviral Env polyprotein sp|P10404|ENV1_MOUSE 0} 13 O} 0.185714 0} 0.00082089 13} 
13|ERO1-like protein alpha sp|Q8R180|ERO1A_MOUSE 0} 5| O| 0.092593 0} 0.00040928 5| 
14|Clathrin heavy chain 1 sp|Q68FD5|CLH_MOUSE 0} 5] O}| 0.026042 O}| 0.00011511 i) 
15|Desmoglein-1-alpha sp|Q61495|DSG1A_MOUSE 2 5] 0.0173913] 0.043478] 9.209E-05| 0.00019218} 2.0868916 
46|Sodium/potassium-transporting ATPase subunit alpha-1 sp|Q8VDN2|AT1A1_MOUSE 4 142] 0.03539823] 0.106195] 0.0001874] 0.0004694] 2.50427 
b. P he yn ft = 
Identified Proteins 
Endoplasmic reticulum membrane 
1__|DnaJ homolog subfamily B member 11 Q99KV1|DJB11_MOUSE 6] 0.19512195] 0. 0.0010332] 0.00064686] 0.6260675| 
2 78 kDa glucose-regulated protein P20029|GRP78_MOUSE 1.01388889} 0. 0.0053687] 0.00435879] 0.8118866) 
3 Serpin H1 P19324| SERPH_MOUSE 17| 0.23404255] 0. 0.0012393] 0.00159879} 1.2900785) 
4 _|Protein transport protein Sec61 subunit beta Q9CQS8|SC61B_MOUSE 3 0.2 E 0.001059} 0.00132605] 1.252135 
is} Leucine-rich repeat-containing protein 59 Q922Q8|LRC59_MOUSE 0.08571429] 0. 0.0004539] 0.00025258} 0.5565044, 
6 Protein transport protein Sec61 subunit alpha isoform 1 P61620|S61A1_MOUSE 0.17307692] 0. 0.0009165} 0.00085003} 0.9275074| 
u Dolichy!-diphosphooligosaccharide--protein glycosyltransferase 48 kDa subunit 054734|OST48_MOUSE 0.06122449] 0. 0.0003242} 0.00045104] 1.3912611 
8 Estradiol 17-beta-dehydrogenase 12 070503|DHB12_MOUSE 0.2] 0. 0.001059} 0.00075774| 0.7155057 
Melanosomes and vesicles membranes 
4 Flotillin-2 Q60634|FLOT2_MOUSE 0.06382979] 0.' 0.000338] 0.00018809} 0.5565044, 
2 ___|Cathepsin D P18242|CATD_MOUSE 0.06666667| 0. 0.000353] 0.00049113} 1.3912611 
3 AP-2 complex subunit beta Q9DBG3|AP2B1_MOUSE 0.07619048] 0.' 0.0004034| 0.00021048} 0.5217229 
4 AP-2 complex subunit mu P84091|AP2M1_MOUSE 0.08 1] 0.0004236] 0.00044202] 1.0434458 
5 ___|Annexin A2 P07356|ANXA2_MOUSE 0.1025641] 0. 0.0005431] 0.00068003| 1.252135) 
6 Melanocyte protein PMEL Q60696| PMEL_MOUSE 0.07575758] 0. 0.0004011} 0.00026789] 0.6678053, 
Cell membrane 
1__|Desmoplakin E9Q557|DESP_MOUSE 0.18918919} i 0.0010018] 0.00079643] 0.7950063, 
2 PDZ domain Q9Z0G0|GIPC1_MOUSE 0.36111111 i 0.0019121] 0.00110504] 0.5779085) 
3 Junction plakoglobin Q02257|PLAK_MOUSE 0.45121951] 0. 0.0023893] 0.00264133} 1.1054885) 
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Identified Proteins Accession Number C57 Spc Pee C57 SAF eal C57 _NSAF ee EE 

Endoplasmic reticulum membrane LL LS SS eS 
1 Stromal cell-derived factor 2-like protein 1 


sp|Q9ESP1|SDF2L_MOUSE 3 
2__|Nicalin sp|Q8VCM8|NCLN_MOUSE 2 
3 Translocation protein SEC62 sp|Q8BU14|SEC62_MOUSE 4 
4 __|Disco-interacting protein 2 homolog B Q3UH60|DIP2B_MOUSE 4 


o_o. 
melanosomes and Vesicles membranes 1 


if Vacuolar protein sorting-associated protein 35 Q9EQH3|VPS35_MOUSE i . 2 
2___|Angiomotin-like protein 2 Q8K371|AMOL2_MOUSE x y ; 
3 Fibrous sheath-interacting protein 2 sp|A2ARZ3|FSIP2_MOUSE 

Cell membrane Lee ee SS ee CY a eT] 


a-c, 20 pg of native cell membrane proteins were incubated with 50 ug of syngeneic (C57BL/6) or allogeneic (129S1) IgG coupled to protein G magnetic beads, and precipitated proteins were analysed by mass 
spectrometry. Shown are conversion to spectral abundance factor (SAF) and subsequent normalized spectral abundance factor (NSAF). This was based on the equation NSAF = (SpC/MW)/Z(SpC/MW)n; where 
SpC is spectral counts, MW is protein molecular mass in kDa and n is the total number of proteins. 
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X-domain of peptide synthetases recruits 
oxygenases crucial for glycopeptide biosynthesis 


Kristina Haslinger'*, Madeleine Peschke'*, Clara Brieke', Egle Maximowitsch! & Max J. Cryle! 


Non-ribosomal peptide synthetase (NRPS) mega-enzyme complexes 
are modular assembly lines that are involved in the biosynthesis of 
numerous peptide metabolites independently of the ribosome’. The 
multiple interactions between catalytic domains within the NRPS 
machinery are further complemented by additional interactions with 
external enzymes, particularly focused on the final peptide matura- 
tion process. An important class of NRPS metabolites that require 
extensive external modification of the NRPS-bound peptide are the 
glycopeptide antibiotics (GPAs), which include vancomycin and 
teicoplanin”*. These clinically relevant peptide antibiotics undergo 
cytochrome P450-catalysed oxidative crosslinking of aromatic side 
chains to achieve their final, active conformation* !*. However, the 
mechanism underlying the recruitment of the cytochrome P450 oxy- 
genases to the NRPS-bound peptide was previously unknown. Here 
we show, through in vitro studies, that the X-domain’*"*, a conserved 
domain of unknown function present in the final module of all GPA 
NRPS machineries, is responsible for the recruitment of oxygenases 
to the NRPS-bound peptide to perform the essential side-chain cross- 
linking. X-ray crystallography shows that the X-domain is structurally 
related to condensation domains, but that its amino acid substitu- 
tions render it catalytically inactive. We found that the X-domain 
recruits cytochrome P450 oxygenases to the NRPS and determined 
the interface by solving the structure of a P450-X-domain complex. 
Additionally, we demonstrated that the modification of peptide pre- 
cursors by oxygenases in vitro—in particular the installation of the 
second crosslink in GPA biosynthesis—occurs only in the presence 
of the X-domain. Our results indicate that the presentation of pep- 
tidyl carrier protein (PCP)-bound substrates for oxidation in GPA 
biosynthesis requires the presence of the NRPS X-domain to ensure 
conversion of the precursor peptide into a mature aglycone, and that 
the carrier protein domain alone is not always sufficient to generate 
a competent substrate for external cytochrome P450 oxygenases. 
Few compound classes have had such a positive effect on human 
health as antibiotics’®. Concern over the rise of antibiotic resistance makes 
it essential to develop new chemotherapeutics, through the discovery 
of novel antibiotics and the renewed exploitation of existing ones'*’’. 
The GPAs are a group of compounds in current use that are highly ef- 
fective against Gram-positive bacterial infections resistant to other classes 
of antibiotics”: examples are the natural products vancomycin and teico- 
planin and semi-synthetic derivatives under development’®. The GPAs 
are complex molecules, comprising heptapeptide aglycones with a high 
percentage of non-proteinogenic amino acids, which are crosslinked 
through multiple aryl and phenolic links. The aglycones are further 
decorated through many different processes, including glycosylation 
and sulfonation”. GPAs function through the formation ofa non-covalent 
complex of micromolar affinity with a peptidoglycan precursor (lipid II). 
This complex is formed via hydrogen bonds from backbone amide 
groups of the glycopeptide aglycone, and the crosslinked aromatic side 
chains are crucial in giving the aglycone the three-dimensional shape 
required for binding’. GPAs are biosynthesized without the ribosome 
by a linear NRPS (Fig. 1 and Extended Data Fig. 1)'”. In glycopeptide 


biosynthesis, the maturation of the heptapeptide requires the NRPS for 
interaction with external oxygenase enzymes—cytochromes P450 (OxyA, 
B, C, E)—that catalyse the (aryl/phenolic) crosslinking of aromatic 
amino acid side chains and provide the final, rigid aglycone structure’. 
Extensive in vivo gene disruption experiments have suggested roles for 
each of the Oxy proteins and also a specific sequence of oxidation, with 
OxyB introducing the first crosslink*-”''"”. In vitro and in vivo experi- 
ments have indicated that the peptide substrates for these P450 oxy- 
genases remain bound to the NRPS during oxidation*”"°. Our recent 
studies have concentrated on understanding the later, crucial, stages of 
glycopeptide biosynthesis, in particular the cyclization of the linear pep- 
tide through the oxidative crosslinking of aromatic side chains by the 
Oxy proteins*’”. We are now focusing on the role of a domain of un- 
known function that is present in the final module of all glycopeptide- 
type NRPS machineries. From phylogenetic analyses, this domain 
appears to be related to the condensation/epimerization (C/E)-domains 
found in NRPS biosynthesis and most closely to an “C-type condensa- 
tion domain. However, it seems to be catalytically inactive due to the 
mutation of residues of the highly conserved HHxxxDG motif essential 
for peptide bond formation and epimerization”’, and is referred to as 
the ‘X-domain’. 

As the final NRPS modules of GPAs all contain an X-domain, we 
sought to clarify the molecular function of this domain. We determined 
the crystal structure of the X-domain from the final NRPS module of 
teicoplanin biosynthesis (X,,;) to 2.9 Aresolution (Fig. 2a and Extended 
Data Table 1)'*’. The X-domain adopts a C/E-type fold, which is char- 
acterized by a V-shaped arrangement of two subdomains that both be- 
long to the chloramphenicol-acetyltransferase (CAT) fold. Crossover 
elements between the subdomains comprise the ‘floor’ of the V-shaped 
cleft (I; Fig. 2a, magenta) and the crossover ‘latch’ (II; Fig. 2a, orange). 
The X-domain also contains several insertion regions compared with 
other structurally characterized C/E-domains (Fig. 2a, red/yellow). Im- 
portantly for the function of the X-domain, the acceptor entry side of the 
X-domain is blocked by the orientation of several loop regions (Fig. 2a, 
cyan) and no tunnel through the X-domain can be observed. While the 
structural active-site motif residues H140 and D145 are maintained, the 
residues mutated from the canonical C-domain active site (HRxxxDD; 
bold text indicates mutated residues) impact on the potential catalytic 
function of this domain: the side chains of both residues R141 and D146 
project directly into the position expected to be occupied by the donor 
4'-phosphopantetheine group during peptide bond formation in an 
active C-domain (Fig. 2c and Extended Data Fig. 4a). 

Since the structure of the X-domain suggested that its function in 
glycopeptide biosynthesis was unlikely to be related to amide-bond for- 
mation, we considered its possible role in the interaction with peptide- 
modifying proteins in trans: the Oxy proteins responsible for side-chain 
cyclization of the NRPS-bound peptide*®. In vivo experiments had pre- 
viously indicated the probable association of the NRPS with the oxy- 
genase enzymes’, and the initial oxidation in vancomycin biosynthesis 
by OxyB,,, has been shown to occur in vitro through the oxidation of 
substrates bound to carrier proteins”’°. However, we have recently 
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demonstrated that OxyB,,, displays a promiscuity in substrate selec- 
tion'” not observed for the OxyB protein from teicoplanin”*™ biosyn- 
thesis: for OxyB,.; the carrier protein domain is insufficient to mediate 
efficient crosslinking of the bound peptide®. We therefore investigated 
several constructs from the last teicoplanin NRPS module to assess 


C-terminal 


N-terminal subdomain 


subdomain 


Figure 2 | Structures of the X-domain and the X-domain-OxyB,,; complex. 
a, Isolated X-domain: the secondary structure displays the topology typical of 
C/E-domains. b, The X-domain-—OxyB,.; complex: selected OxyB helices 

are labelled. c, X-domain active site showing the effects of the residues mutated 
from the canonical C-domain active-site R141 and D146. d-f, Selected amino 
acids forming important interactions in the interface are shown (OxyBiei 

F-helix D161 (d) and D162 (e), OxyB,.; G-helix (f)). Colour scheme: X-domain 
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N-terminal ' 
subdomain 


Tep12 Figure 1 | Structure of the teicoplanin aglycone 
and a schematic pathway of teicoplanin 
Module 7 biosynthesis by non-ribosomal peptide synthesis. 
L-Dpg 


Domain labels for NRPS proteins (Tcp9-12): A, 
adenylation (selected amino acids indicated above 
the module: Hpg, 4-hydroxyphenylglycine; Dpg, 
3,5-dihydroxyphenylglycine); C, condensation; 

NH E, epimerization; T, thiolation/peptidyl carrier 

al protein (PCP); TE, thioesterase; X, domain of 
unknown function. Essential P450-catalysed 
aglycone rigidification takes place through 
crosslinking of aromatic side chains (OxyA-C, 
OxyE). Each crosslinking reaction is performed by 
a specific Oxy protein, with the products of each 
Oxy protein indicated schematically; standard 
ring nomenclature is indicated on the teicoplanin 
aglycone in red lettering. 


potential interactions of larger NRPS constructs with OxyB,,;: to achieve 
this we used both gel filtration co-elution experiments and native poly- 
acrylamide gel electrophoresis (PAGE) mobility shift assay. Beginning 
with the largest construct, which contains three NRPS domains, PCP7— 
X- thioesterase (see Fig. 1), we observed co-elution of OxyBei with the 


; { C-terminal 


| 4 subdomain 
-* 


amino-terminal subdomain, green; carboxy-terminal subdomain, blue; 
crossover element I, magenta, and II, orange; X-domain insertions 45/d, red, 
and B,2/B,3, yellow; loops occluding the acceptor site, cyan; X-domain 
protein surface, grey; OxyB,.i, yellow; water molecules mediating interactions 
between the X-domain and OxyB,.; are shown as blue spheres. Residues 
displayed as sticks are labelled, as are selected secondary structure elements; 
hydrogen bonds are indicated by dashed lines with distances (A). 
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NRPS tri-domain by gel filtration (Extended Data Fig. 2a). This asso- 
ciation was confirmed by the appearance ofa band in the native-PAGE 
assay of lower electrophoretic mobility that contained both OxyB,,; and 
the NRPS tri-domain (Extended Data Fig. 3a), as confirmed by peptide 
mass fingerprinting. Repetition of these experiments with smaller NRPS 
constructs showed that the minimal construct required for OxyBie; in- 
teraction from the final NRPS module of teicoplanin was the X-domain 
itself (Fig. 3 and Extended Data Figs 2a, 3a). Native-PAGE analysis of 
OxyB,,; in the presence of increasing concentrations of X-domain showed 
that at equimolar concentrations OxyB is fully bound and that the appar- 
ent dissociation constant is in the low micromolar range (Extended Data 
Fig. 3c). This is within the range observed for other P450-NRPS inter- 
actions—albeit here in the absence of the actual P450 substrate’?”>*”. 
With the interaction of OxyB,,; and the X-domain of teicoplanin dem- 
onstrated, we repeated these experiments for vancomycin-type systems 
and found comparable results (Fig. 3 and Extended Data Fig. 3a). The 
crossover experiment with alternate P450-NRPS pairs also indicated 
that OxyB,., can interact with the teicoplanin-producing NRPS domain 
(Extended Data Fig. 3a)—which agrees with the in vivo activity observed 
for oxyB cross-complementation studies ''’”. These experiments indi- 
cate that the interaction of the X-domain with OxyB is a conserved 
feature in glycopeptide biosynthesis. 

We then determined a crystal structure of the protein complex be- 
tween the X-domain and OxyB,,; to 2.5 A resolution (Fig. 2b, Extended 
Data Figs 4b, 5a and Extended Data Table 1) The orientation of the com- 
plex places the upper surface of the P450 (with the haem-centred active 
site) facing towards the donor site of the X-domain cleft, an orientation 
that would allow electron transfer proteins to interact with the P450 prox- 
imal face. Comparison of the complex with the isolated P450 (ref. 8) and 
X-domain structures indicates that there is little rearrangement of either 
protein on interaction (root mean squared deviation 1.1 and 0.9 A, 
respectively); the interaction is more of a rigid body docking and is 
mediated through approximately 20 residues on each protein. The in- 
teractions between the two proteins are mainly via hydrogen bonds and 
salt bridges; hydrophobic interactions, as are often seen in intramole- 
cular NRPS domain interactions, are limited to isolated residues in the 
X-domain-OxyB interface (Fig. 2d-f). The start of the F-helix (PRDD) 
is conserved in the Oxy proteins (Extended Data Fig. 5b) and is critical 
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Elution profiles: Molecular mass Native PAGE 


distributions: 


— P450, 415 nm 1: GB1-X 5: OxyA+GB1-X 
— P450-NRPS mix (1:3), 415 nme P450 2: OxyA 6: OxyB+GB1-X 
--- P450-NRPS mix (1:3), 280 nm  P450-NRPS mix (1:3) 3: OxyB_ 7: OxyC+GB1-X 


—-- NRPS, 280 nm e NRPS 4: OxyC 8: OxyE+GB1-X 


Figure 3 | Interaction of the X-domain with Oxy proteins. a, Elution profiles 
of analytical size-exclusion chromatography (solid lines 415 nm, dashed 

lines 280 nm detection) and molecular masses of the individual species 
observed by multi-angle light scattering of individually sampled and co-eluted 
OxyB,.; and X;.; fused to GB1 (protein G, B, domain; 1:3 mixture). AU, 
arbitrary units. b, Native PAGE of teicoplanin X,,;-domain and Oxy,,; proteins 
(1-4) and 3:1 mixtures thereof (5-8; asterisks indicate the new bands of low 
electrophoretic mobility, triangle indicates the band of OxyE,,; that does not 
co-migrate with GB1-X,,;). c, Native PAGE of vancomycin-producing X¢ep- 
domain and Oxy,an proteins (sample order as in b with the square indicating 
the band of OxyC,, that does not co-migrate with GB1—X_.p). 
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for interaction with the X-domain. The B-B, loop region (66-77), typ- 
ically disordered in the absence of a bound substrate in P450 oxyge- 
nases, remains unresolved, indicating that the binding of the X-domain 
does not stimulate closure of the P450 active site but rather acts to dock 
onto the P450 to present the neighbouring PCP7-peptide substrate. We 
modelled the position of the PCP7-domain based on the complex struc- 
ture of the two known P450-carrier protein complexes;”°”* this indicates 
that the PCP orientation from the skyllamycin P450-PCP complex is 
compatible with the X-P450 complex structure (Extended Data Fig. 6). 
Mutation of specific residues in each of the three X-domain interaction 
regions revealed that the loop region after helix «4 that contains resi- 
dues R167 and R171 (which contact the conserved PRDD Oxy motif) is 
essential for the interaction (Extended Data Fig. 2b). 

Since no in vitro activity for any Oxy protein beyond the initial cy- 
clization step catalysed by OxyB,,,, has been described so far®’®'’, we 
examined whether the X-domain interacted with the remaining oxy- 
genases (OxyAtei, OxyC;-; and OxyE,;) using the interaction assays de- 
scribed earlier for OxyB (Fig. 3 and Extended Data Figs 2c, 3b). These 
experiments showed that both OxyA;.; and OxyC,,; interact with a min- 
imal X-domain construct from teicoplanin biosynthesis; the same holds 
true for vancomycin-type systems. The affinities of the teicoplanin Oxy 
enzymes for the X-domain, as determined by gel filtration, appeared 
to be lower than that of OxyB, with OxyA binding more tightly than 
OxyC. Differential affinity for the X-domain implies that additional 
selectivity for Oxy binding is based on the state of the crosslinking of 
the peptide. Thus, the role of the X-domain in the glycopeptide NRPS is 
to act as a general recruitment platform for oxygenase enzymes in gly- 
copeptide tailoring: this differs from other carrier protein—P450 sys- 
tems such as those found in aminoacyl-PCP oxidation””’ or biotin 
biosynthesis**—these rely on the carrier protein-substrate pair for sub- 
strate recognition. 

Curiously, OxyE—the oxygenase responsible for the F-O-G ring in- 
stallation in teicoplanin biosynthesis—did not display an interaction 
with the teicoplanin X-domain (Fig. 3 and Extended Data Fig. 2c). How- 
ever, as OxyE,,; catalyses a phenolic coupling step with minimal peptide 
penetration into the P450 active site (residues 1 and 3), it is plausible 
that this P450 requires the presence of the PCP7-bound peptide to main- 
tain a tight interaction to the NRPS. As the F-O-G ring is not required 
for antibiotic activity but rather moderates selectivity against different 
bacterial targets*”®, all oxygenase enzymes crucial for peptide tailoring 
in GPA biosynthesis are recruited to the NRPS-bound heptapeptide by 
the X-domain. 

The well-studied OxyB enzyme from the vancomycin system (OxyByan) 
has been shown to catalyse the in vitro oxidation of heptapeptides with 
decreased efficiency compared with its efficient turnover of hexapep- 
tides presented by the PCP-domain from module seven of the NRPS 
(PCP,), leaving the question of its natural substrate unanswered’. We 
sought to reconstitute the enzymatic activity of OxyB,,, in a more nat- 
ural context, comprising a simplified heptapeptide (Supplementary 
Fig. 1) presented by a PCP;-domain in the absence and presence of 
the X-domain. With this approach we observed a moderate increase in 
turnover yield by using the PCP7—X construct (Fig. 4 and Extended 
Data Fig. 7c). The effects of the X-domain were more pronounced for 
OxyB,.; due to the comparatively low activity of OxyB,.; in the in vitro 
turnover of PCP,-bound model hexapeptides:* indeed, OxyB,.; is highly 
active in cyclizing heptapeptides bound to the PCP7-X constructs, 
whereas it is only able to catalyse ~25% C-O-D ring formation in 
PCP,-bound teicoplanin-like heptapeptides under the same conditions 
(Fig. 4 and Extended Data Fig. 7a). Activity assays performed with the 
PCP,7—X-domain interface mutants confirm the results from the inter- 
action studies (Extended Data Fig. 2b), with no activity observed for the 
X, variant (Fig. 4 and Extended Data Fig. 8). Mutant variants that retain 
the ability to bind OxyB,,; retain P450 activity comparable with the wild- 
type PCP,-X protein, underpinning the importance of the X-domain 
for the OxyB,,; turnover reaction. We examined whether the differences 
observed in the presence of the X-domain in these systems were due to 
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Figure 4 | In vitro activity of OxyB,,; and its homologues. a, Turnover of 
heptapeptide substrates bound to PCP7 and PCP7—X (GBI fusion proteins) 
from the teicoplanin and chloroerymomycin NRPS with OxyB enzymes from 
teicoplanin-like systems (OxyB,,;, StaH) and vancomycin-like systems 
(OxyByan» CepF) expressed as a percentage of the total peptide detected. 
Heptapeptide substrates loaded onto the PCP;/PCP7-X constructs are 
indicated above the respective OxyB enzymes; X-domain variants X;—X3 and 
X,_3 of the teicoplanin NRPS (described below the graph) show strongly 
reduced activity for constructs X, and X,_; that possess R167A and R171A 
mutations contacting the conserved PRDD Oxy motif in the wild-type protein 
(average of triplicate experiments; error bars indicate standard deviation (s.d.)). 
b, Electrospray ionization tandem mass spectrometry (ESI-MS/MS) analysis 
of high-performance liquid chromatography (HPLC)-purified monocyclic 
teicoplanin-like OxyB turnover product, demonstrating the anticipated 
fragmentation for a C-O-D monocyclic peptide structure (Supplementary Fig. 5). 


general differences between the biosynthesis of different GPA classes 
by investigating the effect of the X-domain on OxyB activity from two 
related glycopeptide biosynthetic machineries: those of chloroerymo- 
mycin (cep; van-type/type-I)”’ and A47934 (sta; tei-type/type-IV)*° GPAs. 
These results were even more pronounced than for the teicoplanin system, 
with effective OxyB activity seen only in the presence of the X-domain 
(Fig. 4 and Extended Data Fig. 7b, d): this indicates that the activity of 
OxyB,an is atypical and that the general mechanism for recruitment of 
the Oxy enzymes in GPA biosynthesis requires the X-domain. 

To clarify further the role of the X-domain in Oxy recruitment, we 
tested the activity of OxyB,,; on a hexapeptide bound to the PCP-domain 
from module six of the NRPS (PCP,): these turnovers showed no activ- 
ity of OxyB;.i (Extended Data Fig. 9d). This agrees with previous in vivo 
experiments that report very little isolation of crosslinked hexapeptides 
compared with the final aglycone"'. Taken together, these results strongly 
suggest that crosslinked hexapeptide products only appear from minor 
shunt pathways and do not represent the normal biosynthesis route 
in vivo, which instead relies on X-domain-mediated oxygenase recruit- 
ment to the module-seven-bound heptapeptide. 
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Figure 5 | Coupled in vitro activity of OxyA;.; and OxyB,.;. a, Turnover of 
heptapeptide substrates bound to PCP, and PCP,-X (GBI fusion proteins) 
expressed as a percentage of the total peptide detected and shown as stacked 
bars (average of triplicate experiments, error bars indicate s.d.). b, ESI-MS/MS 
analysis of HPLC-purified bicyclic teicoplanin-like OxyB/OxyA turnover 
product, demonstrating the anticipated fragmentation for a C-O-D/D-O-E 


bicyclic peptide structure (Supplementary Fig. 6). 


One major hurdle in GPA biosynthesis has been the lack of in vitro 
activity of any Oxy protein beyond the C-O-D ring formation catalysed 
by OxyB. With results indicating that the X-domain was able to recruit 
all essential Oxy enzymes (OxyA-C), we investigated whether the pres- 
ence of the X-domain would support the activity of additional Oxy en- 
zymes. In coupled assays, the combination of OxyB,,; and OxyA,.; was 
now able to install both C-O-D and D-O-E crosslinks in a PCP7-X-bound 
heptapeptide substrate (Fig. 5 and Extended Data Fig. 9a). In agree- 
ment with the reported in vivo oxidation order'""”, the combination of 
OxyB,.; and OxyC,,; did not lead to the generation ofa bicyclic peptide 
product (Extended Data Fig. 9b). However, the lack of activity in this 
assay could have been due to the limited catalytic competence of the 
OxyC,,; enzyme (P420 signal for the CO-complex); this also prevented 
the recreation of the OxyA-C cascade (Extended Data Fig. 9c). In spite 
of this, our demonstration of in vitro OxyA activity clearly indicates 
the importance of the X-domain as a recruitment platform in the Oxy- 
catalysed installation of crosslinks in GPA biosynthesis and opens the 
door to the reconstitution of GPA aglycone biosynthesis in vitro. More 
broadly, our results show that the recruitment of enzymes to NRPS 
systems can be mediated by domains beyond carrier proteins and that 
such mechanisms may have a wider role in biosynthesis by megaen- 
zyme synthetase machineries than has been acknowledged. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Cloning of constructs. For expression in E. coli, codon-optimized genes encoding 
the terminal modules of the chloroerymomycin (vancomycin-type) PCZA363.5 
(ref. 29) and teicoplanin (Tcp12)**** producing non-ribosomal peptide synthetases 
as well as the C-terminal part of the second last teicoplanin producing module (Tcp11; 
sequence range 2374-4067)” were obtained from Eurofins Genomics MWG. On 
the basis of domain predictions provided by Interpro*’, combined with predicted 
secondary structure provided by Jpred*, N-terminally shortened versions of the 
proteins resulting in single (PCP;, X) and multi-domain (PCP;-X, PCP;-X-TE) 
constructs were cloned. The respective domains were amplified by PCR using KOD 
polymerase (Novagen) with specific primers (Supplementary Table 1) that gener- 
ate unique Ncol and Xhol restriction sites in order to clone the genes into a mod- 
ified pET vector containing the IgG-binding B1 domain of Streptococcus (GB1)*” 
such that the gene is expressed in fusion with GB1 under the control of a T7 pro- 
moter. In all constructs the expressed protein includes an N-terminal hexahistidine 
tag followed by GB1 and a TEV cleavage site, together with a C-terminal Strep-II 
tag. The PCP-domain from module 6 was amplified by PCR using a specific for- 
ward primer containing an Ncol cleavage site together with a reverse primer that 
binds to the M13 terminator region of the tcp11-containing plasmid provided by 
MWG, thereby using the existing HindIII cleavage site at the 3’ end of the insert. 
The PCR fragment was cloned with the indicated restriction enzymes into a mod- 
ified pET vector containing maltose binding protein (MBP)” such that the gene was 
expressed in fusion with MBP under the control of a T7 promoter. The expressed 
PCP,-domain includes an N-terminal hexahistidine tag followed by MBP and a 
TEV cleavage site. The resulting clones were sequenced using standard T7 promoter 
and terminator as well as internal sequencing primers and were used to transform 
chemically competent BL21-Gold (DE3) E. coli cells. X-domain variants of the 
GB1-PCP;-X construct were generated with the In-Fusion HD Cloning Kit (Takara 
Clontech) with a combination of internal primers together with vector-specific 
primers (fwd, TTCTGAGAATCTTTATTTTCAGG; rev, TGTGGATGACTCCA 
AGCACTCTCG). In each construct a set of two mutations was introduced at the 
same time: X,, R167A, R171A; X3, E290A, D291A; and X3, E377A, R382A (amino 
acids are numbered according to the presented X-domain structure with position 
3 corresponding to R1047 in the full-length Tcp12 protein according to UniProt 
numbering). Furthermore, a sextuple mutant was generated with the same primers 
(X,_3). The genes coding for the teicoplanin producing P450 enzymes were amp- 
lified from genomic DNA and cloned into pET151D-TOPO (Life Technologies) 
using a 5’-CACC sequence to ensure directional cloning. Proteins were expressed 
from the resulting vectors under the control of the T7 promoter with an N-terminal 
hexahistidine tag and V5 epitope followed by a TEV cleavage site. The genes coding 
for the vancomycin (genomic DNA)**, A47934 (genomic DNA)” and chloroer- 
ymomycin (synthetic DNA; MWG)” producing P450 enzymes were cloned into 
pET28a(+) (Novagen) using unique Ndel and HindIII restriction sites. Proteins 
were expressed from the resulting vectors under the control of the T7 promoter 
with an N-terminal hexahistidine tag V5 followed by a thrombin cleavage site. Iso- 
lated plasmids were sequenced as described for NRPS constructs and used to trans- 
form chemically competent E. coli KRX cells. 

Purification of proteins. P450 enzymes were expressed and purified as described 
previously*; NRPS domains were expressed and purified in two affinity chromato- 
graphy steps (metal affinity and StrepTrap (except for MBP-PCPg)) anda final size- 
exclusion purification step as previously described for GB1-PCP7,<i°. 

Peptide synthesis. Solid-phase peptide synthesis and peptide-CoA synthesis were 
performed according to the protocol established previously'””*: peptides were as- 
sembled on Dawson resin using Fmoc chemistry” and, after release from the resin, 
transformation to the thioester using MPAA and deprotection, the peptide thioe- 
sters were converted via thioester exchange to peptide-CoA conjugates (phosphate 
buffer, pH 8.3) and isolated using solid-phase extraction methods'’. For compound 
characterization see Supplementary Figs 1-3. 

PCP modification with Sfp. PCP7-domains in the respective constructs were loaded 
with the teicoplanin-like NH2-D-Hpg-p-Tyr-L-Hpg-p-Hpg-b-Hpg-t-Tyr-[D,L]-Hpg- 
C(O)R (Tei7(Hpg;,7)) or the vancomycin-like NH2-D-Leu-p-Tyr-L-Asn-D-Hpg- 
p-Hpg-t-Tyr-[D,L]-Hpg-C(O)R (Van7(Hpg)) heptapeptide (see Supplementary 
Fig. 1) for 1 hat 30 °C in 50 mM HEPES pH 7.0, 50 mM NaCl, 10 mM MgCl, using 
the mutant R4-4-Sfp” (1:10 ratio of Sfp to PCP) with a 2-3-fold excess of peptide- 
CoA. PCP,-domain loading was performed as indicated earlier, with the teicoplanin- 
like NH2-D-Hpg-b-Tyr-L-Hpg-p-Hpg-p-Hpg-L-Tyr-C(O)R (Tei6(Hpg;)) hexa- 
peptide. After loading, free peptide-CoA was removed by washing with low-salt 
buffer 50 mM HEPES pH 7.0, 50 mM NaCl (4X 1:5 dilution) in centrifugal filter 
units with a 10,000 MW cut-off (Merck Millipore). Peptidyl-PCP constructs were 
concentrated and used for turnover assays immediately to minimize hydrolysis of 
the thioester. 

Analytical size-exclusion chromatography and multi-angle light scattering. Ex- 
periments were performed with a Superose-12 10/300 GL column (GE Healthcare) 


connected to an HPLC system (Waters GmbH) with online ultraviolet (Waters), 
multi-angle light scattering (MALS) (DAWN HELEOS, Wyatt Technology) and 
refractive index (Waters) detection. All samples were prepared in 120 pl 50 mM 
Tris pH 7.4, 100 mM NaCl with the P450 concentrations adjusted to 33.3 uM and 
those of the NRPS domains to 100 iM. After an incubation of 30 min at room tem- 
perature, 100 pl of the samples were injected into the HPLC system and analysed 
with ASTRA6 (Wyatt) and GraphPad Prism 5. 

Native PAGE. Native PAGE was performed at 4 °C with 12% TGE acrylamide gels 
(25 mM Tris, 192 mM glycine, 1 mM EDTA, pH 9.0 (NaOH)). The samples were 
prepared in 7.5 ul TGE buffer and incubated for 30 min at room temperature (10 1M 
P450 concentration, 0-30 1M NRPS concentration) before 4X loading dye was 
added and the samples were applied to the pre-equilibrated gel (1 h, 300 V). The 
electrophoretic separation was performed at 4 °C for 2h at a constant voltage of 
300 V. The gels were stained with InstantBlue protein stain (Expedeon). 

P450 activity assay and MS/MS analysis of products. In vitro phenolic coupling 
of the heptapeptide substrates bound to PCP7 and PCP7-X constructs from tei- 
coplanin- and vancomycin-type producing NRPS systems was performed with the 
corresponding OxyB (tei/sta/van/cep) or in coupled assays with a combination of 
the Oxy proteins (B/A: tei; B/C: tei; B/A/C: tei) immediately after the loading of PCP 
domains with heptapeptides. Additional in vitro OxyB,.; turnover reactions were 
performed with teicoplanin-like hexapeptide bound to the PCP,-domain. The redox 
system composed of palustrisredoxin B variant A105V (PuxB) and the palustris- 
redoxin reductase (PuR)*’ was used in a 50:1:5:2 ratio (PCP:PuR:PuxB:Oxy (main- 
tained for reactions with multiple Oxy enzymes)). The reducing equivalent NADH 
was added in excess (2 mM) to start the reaction and was regenerated throughout 
the assay with a glucose/glucose oxidase (Sorachim) regeneration system (0.33% 
glucose, 9 U ml’ enzyme). The reaction was performed for 1 h at 30 °C and sub- 
sequently stopped by an increase in pH by adding a 23,000-fold molar excess of 
CH3(NH)NH)j over peptidyl-PCP. During an incubation of 15 min at room tem- 
perature the peptide was hydrolysed from the PCP. CH3(NH)NH) was chosen over 
(CH3NH)2 or (CH3)2N-NH) as these reagents did not effectively cleave the pep- 
tide from the PCP; addition of NH2NH; or base catalysed hydrolysis did not afford 
products that could be well resolved by HPLC; thus CH3(NH)NH) was used for 
these studies. The reaction was neutralized by the addition of an equal quantity of 
formic acid and then further diluted 1:1 with acetonitrile (including 0.1% (v/v) for- 
mic acid). After a 10 min centrifugation at room temperature to remove the pre- 
cipitated proteins, the supernatant was analysed by HPLC coupled to MS (gradient: 
5% (v/v) acetonitrile for 4 min, 5-15% (v/v) acetonitrile in 0.5 min, 15—50% (v/v) 
acetonitrile in 21 min, to 95% for 11 min; solvents including 0.1% (v/v) formic 
acid; column: XBridge BEH 300 Prep C18, 5 1m, 4.6 250 mm, Waters). All ex- 
periments were performed in triplicates. MS analysis was performed in negative 
mode. Complex series of products obtained are due to the combination of methyl- 
hydrazide regioisomers and racemization of the C-terminal residue; other effects 
can include the racemization of residues within crosslinks as has been reported 
previously and the observation of masses originating from lower molecular 
weight species (see also Extended Data Figs 7-9). Turnover yields were calculated 
by integrating the peak areas of the product species and correlating them to the 
total peak area of peptide species detected. Products of turnover were isolated by 
HPLC and then subjected to MS/MS analysis using a maXis ultra-high resolution 
time-of-flight (TOF) mass spectrometer (Bruker). Fragmentation patterns were 
identified by comparison to those previously reported (R. Siissmuth, personal 
communication)’. 

Protein crystallization and structural solution. Crystallization of the X-domain, 
both in isolation and in complex with OxyB,,;, was performed at 293 K in hanging 
drops using 100 mM Bis-Tris pH 6.5, 200 mM (NH4)2SO, and 20% PEG 3350 as 
the precipitant solution. Crystals were cryoprotected in reservoir solution supple- 
mented with 20% ethylene glycol, flash cooled and kept below 100 K. Data collec- 
tion was performed at the Swiss Light Source (Paul Scherrer Institute) beamline 
X10SA (A = 0.9797 A (X-domain) and 0.9780 A (X/ OxyB); for details see Extended 
Data Table 1). Data processing was conducted with XDS”; phasing via molecular 
replacement in Phaser*° (using Protein Data Bank accessions 2JGP and 4TVF); 
model building in COOT" refinement and quality control performed using Phenix” 
and MOLPROBITY™*. 97.3% of all residues in the X-domain structure are in the 
most favoured regions of the Ramachandran plot and 2.7% in the allowed; 0.3% of 
the side chains adopt less probable rotamer conformations and the clash score is 
5.4. In the structure of the complex, 96.2% of all residues are in the most favoured 
regions of the Ramachandran plot, 3.7% in the allowed and 0.1% in the disallowed; 
0.9% of the side chains adopt less probable rotamer conformations and the clash 
score is 4.0. Superposition was performed using SSM“ in COOT and figures were 
generated with PyMOL (http://www.pymol.org). Structural comparison was per- 
formed using Dali**; multiple sequence alignments were generated with ClustalW2 
(ref. 46), and interface residues were identified by PISA’. 
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Extended Data Figure 1 | Overview and graphical summary of this study. 
a, b, Final stages of teicoplanin aglycone biosynthesis (a) and vancomycin-type 
aglycone biosynthesis (b). c, d, In vitro binding studies demonstrated that 
the minimal interaction interface of OxyA-C is the conserved X-domain from 
the final module of the teicoplanin non-ribosomal peptide synthetase (c) and 
that the OxyB,.; and OxyA;ei enzymes required for peptide maturation are only 
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active against peptide-bound PCP,-substrates that also contain this X-domain 
(d). These results support the hypothesis that in vivo aglycone formation 
occurs through the interaction of Oxy enzymes with module seven of the 
teicoplanin non-ribosomal peptide synthetase that then act upon the NRPS- 
bound linear heptapeptide precursor. 
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Extended Data Figure 2 | Interaction of Oxy enzymes with NRPS domains 
studied by analytical size-exclusion chromatography and multi-angle light 
scattering. a, b, Profiles of OxyB,,; (red) shown with the profiles of the 

corresponding NRPS domain constructs and variants (orange) and mixtures of 


the binding partners in a 1:3 ratio (blue). c, Profiles of OxyAtei; OxyCtei; OYE tei 
(red) shown with the profiles of the corresponding GB1-X constructs (orange) 
and mixtures of the binding partners in a 1:3 ratio (blue). Solid lines 
represent detection at 415 nm and dashed lines at 280 nm detection. 
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Extended Data Figure 3 | Interaction of Oxy enzymes with NRPS domains 
studied by native-PAGE. a, Top, overview of electrophoretic migration 
behaviour of OxyB,,, and OxyB,,; in the absence/presence of different NRPS 
domain constructs (GB1—PCP7, -PCP7—X, -PCP7—-X-TE, -X; 1:3 molar 
ratio). Bottom, interaction studies of OxyB,.j with NRPS,.p proteins and 
OxyByan With NRPS,.i proteins. b, Overview of electrophoretic migration 
behaviour of OxyA,.;, B,.; and C,,; in the absence/presence of GB1-X,,; or 


GB1-PCP,-X-TE,,; (1:3). ¢, Titration of OxyBye; (10 HM) with GB1-X,.; (0 to 
30 1M) leading to the appearance of a new band with low electrophoretic 
mobility and the disappearance of free OxyB,.; at equimolar concentrations. 
The bands corresponding to an Oxy/X complex are marked with an asterisk, 
those corresponding to an Oxy/PCP,—X-TE complex with a triangle and 
those corresponding to Oxy/PCP7—X with a square. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Cross- 
over-ll 


X-domain over-II X-domain (complex) 


E391 


ea TycC5-6 


TycC5-6 


Extended Data Figure 4 | X-domain active site in comparison to TycC5-6. —_ motif is to block the active site. Selected residues are shown as sticks. Hydrogen 
a, b, Comparisons of the X-domain both in isolation (green/orange; a) andin _ bonds are indicated by dashed black/grey lines for the X-domain and in red 
complex with OxyB,.i (green/orange; b) to the active site of TycC5-6 (grey, for the TycC5-6/sulphate distance. 

solved with a bound sulphate ion): the effect of the altered X-domain active-site 
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Extended Data Figure 5 | Structure of OxyB,.; and multiple sequence 
alignment of the Oxy enzymes. a, Structure of OxyB,.; determined in complex 
with the X-domain. Secondary structure elements are labelled and coloured 
grey with the following exceptions: B-helix and loop region (magenta), D-helix 
(cherry red), E-helix (blue), F-helix (orange), G-helix (yellow), I-helix (cyan), 
B1-sheet (green), 82-sheet (purple), 83-sheet (orange), B4-sheet (yellow), 
haem shown as sticks (C atoms, red; N atoms, blue; O atoms, orange) with the 
haem iron shown as a red sphere. b, Sequence alignment of Oxy proteins from 


Actinoplanes teichomyceticus teicoplanin gene cluster***. Secondary structure 
and colour scheme same as a; catalytic residues are shown in orange; 
residues important for X-domain interaction are indicated in the three boxed 
regions, are in bold and highlighted in yellow for OxyB,.;. Colours used for 
the corresponding residues from OxyA,.i; OxyC;.; and OxyE,.; indicate 
agreement with the consensus sequence (green, match; blue, comparable 
interaction; red, potential mismatch). 
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Extended Data Figure 6 | Model of the PCP;-X-OxyB,,; complex in two 
possible arrangements. a-d, Possible PCP position determined from the 
alignment of the P450,,,-PCP7 complex trapped using a covalent azole 
inhibitor with the OxyB,.;-X-domain complex (a, b) and the structure of the 
P450pior-ACP complex (c, d). The X-domain is shown in grey, OxyB,.; is shown 


LETTER 
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in yellow, the haem of OxyB,,; is shown in red sticks. a, b, The PCP; (sky) is 
shown in magenta and the azole inhibitor covalently bound to the PCP is shown 
as pink sticks. c, d, The ACP is shown in blue and the phosphopantetheine- 
bound fatty acid is shown as blue sticks. The distance between the C terminus of 
the PCP/ACP and the N terminus of the X-domain is shown in a and c. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


SIM (offset) 


Extended Data Figure 7 | HPLC-MS analysis of P450-catalysed peptide 
monocyclization. a—d, Representative HPLC-MS chromatograms of in vitro 
OxyByei (a), StaH (b), OxyB,an (c), CepF (d) turnover reactions with 
heptapeptide Tei7(Hpg; 7) or Van7(Hpg,) bound to GB1-PCP-X (left) and 
GB1-PCP, (right). Ions corresponding to the singly charged, linear 
(hydrolysed m/z 1,088, methylhydrazide m/z 1,116) and crosslinked 
monocyclic (hydrolysed m/z 1,086, methylhydrazide m/z 1,114 ) teicoplanin- 
like peptide and the linear (hydrolysed m/z 1,017.5, methylhydrazide m/z 
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1,045.5 ) and crosslinked monocyclic (hydrolysed m/z 1,015.5, 
methylhydrazide m/z 1,043.5 ) vancomycin-like peptide recorded using single- 
ion monitoring (SIM) in negative mode. Major peaks for each mass represent 
diastereomers due to racemization of the C-terminal Hpg residue and two 
regioisomers derived from methylhydrazine cleavage; smaller peaks can be 
caused by overlapping mass signal detection with ions from lower molecular 
weight species; additionally, racemization of such crosslinked products has 
also been previously observed”. 
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Extended Data Figure 8 | HPLC-MS analysis of OxyB,,;-catalysed Occurrence of ions corresponding to the singly charged, linear peptide 
cyclization of peptides presented by mutant variants of PCP7-X. (hydrolysed m/z 1,088, methylhydrazide m/z 1,116) and the crosslinked 
a-d, Representative HPLC-MS chromatograms of in vitro OxyB,,; turnover monocyclic product (hydrolysed m/z 1,086, methylhydrazide m/z 1,114) were 
reactions with substrate heptapeptide Tei7(Hpg;.7) bound to the different recorded using SIM in negative mode. The origins of the multiple peaks 
GB1-PCP,-X mutants (X;: R167A, R171A (a); Xz: E290A, D291A (b); X3: observed for each mass are described in Extended Data Fig. 7. 


E377A, R382A (c); X;-3: R167A, R1I71A, E290A, D291A, E377A, R382A (d)). 
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Extended Data Figure 9 | HPLC-MS analysis of P450-catalysed peptide 
bicyclization. a—d, Representative HPLC-MS chromatograms of coupled 

in vitro OxyBrei plus OxyAte; (a), OxyC;,.; (b), or Oxy Ajei/OxyC,.; (c) turnover 
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b, c) and GB1-PCP, (a, right); or OxyB,,; alone with hexapeptide Tei6(Hpgs) 
bound to MBP-PCP, (d). Occurrence of ions corresponding to the 
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peaks observed for each mass are described in Extended Data Fig. 7. 
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LETTER 


Extended Data Table 1 | Crystallographic data and refinement statistics for the isolated X-domain and the X-domain in complex with OxyBe; 


X-domain* X-domain/ OxyBtei complex* 

Data collection 
Space group P6, P2122, 
Cell dimensions 

a, b, c (A) 135.0, 135.0, 54.4 87.3, 94.0, 125.3 

a Br CY 90, 90, 120 90, 90, 90 
Resolution (A) 48.0-2.9 50.0-2.5 
Rrmerae ’ 0.12 (0.37) 0.09 (0.49) 
Iol* 24.3 (9.1) 15.0 (3.8) 
Completeness (%)" 99.9 (99.9) 99.9 (99.9) 
Redundancy 20.5 5.9 
Refinement 
Resolution (A) 44.2-2.9 47.0-2.5 
No. reflections 12754 36330 
Rwyorkt Riree 19:57 254 1711227 
No. atoms 

X-domain 3519 3573 

OxyBeei - 2893 

Heme - 43 

Sulfate - 16 

Ethylene Glycol - 4 

Water 18 253 
B-factors 

X-domain 30.7 41.4 

OxyBeei - 46.3 

Heme - 36.0 

Sulfate - 66.3 

Ethylene Glycol - 47.2 

Water 24.9 41.2 
R.m.s deviations 

Bond lengths (A) 0.002 0.008 

Bond angles (°) 0.6 1:2 


* Data were collected from one crystal. 
+ Data for the highest resolution shell are shown in parentheses. 
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CORRECTIONS & AMENDMENTS 


RETRACTION 
doi:10.1038/nature14421 


Retraction: Histone methylation by 
the Drosophila epigenetic 
transcriptional regulator Ash1 


Christian Beisel, Axel Imhof, Jaime Greene, Elisabeth Kremmer 
& Frank Sauer 


Nature 419, 857-862 (2002); doi:10.1038/nature01126 


The authors and the University of California Riverside wish to retract 
this Letter owing to inappropriate image manipulation in the pub- 
lished figures. The figure panels affected are Figure 1b, d, Figure 2b, e, 
Figure 3a and Figure 4d. Nature has not received a response from 
Frank Sauer to approve this retraction. 
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ILLUSTRATION BY THE PROJECT TWINS 


THE TROUBLE WITH 
REFERENCE ROT 


Computer scientists are trying to shore up broken links in the scholarly literature. 


3 


7 


ae 
BY JEFFREY M. PERKEL 


he scholarly literature is meant to be a 
"TPoeenine record of science. So it is an 
embarrassing state of affairs that many 
of the web references in research papers are 
broken: click on them, and there’s a fair chance 
they will point nowhere or to a site that may 
have altered since the paper referred to it. 
Herbert Van de Sompel, an informa- 
tion scientist at the Los Alamos National 
Laboratory Research Library in New Mexico, 
quantified the alarming extent of this ‘ink rot’ 
and ‘content drift’ (together, ‘reference rot’) in 
a paper published last December (M. Klein 
et al. PLoS ONE 9, e115253; 2014). With a 
group of researchers under the auspices of 
the Hiberlink project (http://hiberlink.org), 
he analysed more than 1 million ‘web-at- 
large’ links (defined as those beginning with 
‘http://’ that point to sites other than research 


articles) in some 3.5 million articles published 
between 1997 and 2012. The Hiberlink team 
found that in articles from 2012, 13% of hyper- 
links in arXiv papers and 22% of hyperlinks in 
papers from Elsevier journals were rotten (the 
proportion rises in older articles), and over- 
all some 75% of links were not cached on any 
Internet archiving site within two weeks of the 
article’s publication date, meaning their con- 
tent might no longer reflect the citing author's 
original intent — although the reader may not 
know this. 

Hyperlinks to web-at-large content were 
present in only one-quarter of the 2012 schol- 
arly articles, but some four-fifths of those 
papers that did contain a link suffered from 
reference rot, the team found — that is, at least 
one reference to web-at-large content was 
either dead or not archived. Van de Sompel 
terms the situation “rather dramatic”. Because 
the content of servers can change, or they can 


© 2015 Macmillan Publishers Limited. All rights reserved 


‘go dark’ or change hands, researchers following 
up links to online data sets, software or other 
resources might have nowhere to turn. “You've 
lost a trace to the evidence that was used in the 
research,’ he says. 


SNAPSHOTS OF THE WEB 

Fortunately, online archiving services, such 
as the Internet Archive, make it possible for 
researchers to store permanent copies of a 
web page as they see it when preparing their 
manuscripts — a practice Van de Sompel rec- 
ommends. He urges researchers to include 
their cached link and its creation date in their 
manuscripts (or for publishers to take a snap- 
shot of referenced material when articles are 
submitted). The Harvard Law School Library 
in Cambridge, Massachusetts, has devel- 
oped a web-archiving service called Perma. 
cc (https://perma.cc): enter a hyperlink here 
and the site spits back a new hyperlink for a 
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> page that contains links to both the origi- 
nal web source and an archived version. 
Van de Sompel and others have in the 
past few weeks rolled out a complementary 
approach. It relies on a service that Van de 
Sompel has co-developed called Memento, 
which he dubs “time travel for the web”. The 
Memento infrastructure provides a single 
interface for myriad online archives, allow- 
ing users access to all of the saved versions of 
a given web page. This infrastructure could 
potentially allow access to web-at-large links 
in any scholarly article, even if the linked sites 
go down. Publishers would have to incorpo- 
rate a small piece of extra computer code in 
their articles, and the standard single weblinks 
would have to be replaced with three pieces 
of information — the live link, a cached link 
and its creation date — all wrapped in Van de 
Sompel’s proposed machine-readable tags. 


STORAGE BLOCK 

Van de Sompel says that he is “unbelievably 
enthusiastic” about the team’s approach. But 
the solution depends on the cooperation of 
authors and publishers — who may be disin- 
clined to help. Another issue is that web-page 
owners who hold copyright over content 
can demand that archives remove copies of 
it. They can also disallow archiving of their 
sites by including a file or line of code that 
prevents computer programs from ‘crawling’ 
over or capturing content — and many do. If 
Perma.cc, for instance, encounters such an 
exclusion code, it preserves the content ina 
‘dark archive’; to access a web page in a dark 
archive, the reader must contact a library 
participating in the Perma.cc project and 
request to see the site. 

Scholarly articles that are behind a 
paywall routinely exclude such crawling, 
too — although publishers have introduced 
the DOI system to ensure that scientists can 
confidently cite a persistent hyperlink to the 
right version of an online research article, 
even if the publisher changes its local web 
addresses. (In January, however, the system 
that redirects DOI links went down, showing 
that it is not immune to failure.) Publishing 
companies also guard against link rot by auto- 
matically preserving articles in archives; the 
articles can be released if the company folds. 

But not all companies are archiving, says 
David Rosenthal, a staff member at the 
library of Stanford University in California; 
analysis of data from a monitoring service 
called The Keepers Registry shows that “at 
most 50% of articles are preserved’, Rosen- 
thal writes on his blog (go.nature.com/ 
jrwqo4). So for both web-at-large hyperlinks 
and scholarly articles, the Memento team’s 
mission to solve reference rot may be “exces- 
sively optimistic’, he says. m 


Jeffrey M. Perkel is a writer based in 
Pocatello, Idaho. 
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‘Living figures’ 
make their debut 


Published chart integrates data from outside scientists. 


BY DALMEET SINGH CHAWLA 


Brembs published a paper about how 

fruit flies walk. Nine months on, his paper 
looks different: another group has fed its data 
into the article, altering one of the figures. 

The update — to figure 4 — marks the 
debut of what the paper’s London-based 
publisher, Faculty of 1000 (F1000), is calling 
a living figure, a concept that it hopes will 
catch on in other articles. 

Brembs, at the University of Regensburg in 
Germany, says that three other groups have 
so far agreed to add their data, using software 
he wrote that automatically redraws the fig- 
ure as new data come in. 

His article, written with Julien Colomb, 
chief executive of the start-up firm 
Drososhare in Berlin, finds behavioural 
differences within a strain of fruit fly: the 
Canton Special, or CS strain (J. Colomb 
and B. Brembs F1000Research 3, 176; 2014). 
Although there are substrains, researchers 
usually regard CS flies as so similar that they 
do not distinguish between the substrains in 
their analyses, but Brembs and Colomb report 
that the flies exhibit three types of walking 
behaviour. This might betoken other differ- 
ences in behaviour and therefore confound 
experiments in which CS flies are used as a 
control group, he says. 

Having sequenced the genomes of the flies, 
Brembs thinks that the behaviours have a 
genetic origin and will not be explained away 
by environmental variations between labs. 
The addition of data by other labs could help 
to test whether his theory is correct. 


E: July last year, neurobiologist Bjorn 


ITERATIVE PUBLISHING 
The living figure concept fits within a central 
tenet of F1000’s publishing philosophy, that 
papers can be continually updated. The 
online-only open-access site publishes arti- 
cles immediately with the status ‘Awaiting 
Peer Review, then invites scientists to review 
them. Authors can then update their articles 
with new versions. The process is like adding 
pieces of paper to the top of an existing pile, 
the publisher says. 

Allowing outside researchers to post 
their data into a paper simply takes the 
idea a step further, says Rebecca Lawrence, 
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managing director of the publishing plat- 
form F1000Research. “The idea is that it better 
mirrors the way science is conducted,” she says. 
Other laboratories’ information confirms or 
challenges the published research in an incre- 
mental process. In addition to updating work, 
living figures may allow systematic reviews to be 


updated rather than 
“It’s amore published afresh each 
accessible way time, Lawrence adds. 
for scientists to They should also help 


to address the issue 
of lack of reproduc- 
ibility, she argues, because it provides a way for 
laboratories to release confirmatory data, which 
can be hard to get published. 

Of course, by adding data to someone else's 
article, scientists are giving up the chance to 
publish a paper of their own — a potential 
hurdle, because publications are the lifeblood 
of reputations in academia. But Gregg Roman 
from the University of Houston, Texas, the first 
outside author to add data to Brembs’s paper 
after publication, says that he accepts that. 
“We're sacrificing a bit of recognition,” he says, 
but “it’s a more accessible way for scientists to 
get the answer than if we publish separately”. 

New contributors’ names do, however, 
appear in the legend of updated figures; and 
the updated data set and paper get their own 
DOls. Alternatively, contributors can choose 
to gain a formal publication by submitting 
what F1000Research calls a Data Note that 
links to the original updated paper. 

If the new contributors’ methods or results 
differ significantly from the original paper’s, 
then they can publish a Research Note, 
Lawrence says. They can also request that 
the original authors update their article. An 
updated paper would be peer reviewed again. 

Lawrence says that many research groups 
have shown interest in publishing living figures. 
And the concept could work with traditional 
pre-publication review, too, notes Peter Binfield, 
co-founder of the open-access journal Peer]. “As 
long as the full version history of the article is 
available, and it’s clear which version of the arti- 
cle was reviewed, and in what way, it should be 
possible to publish updates,” he says. 

As for Brembs’s work, Roman says that his 
data seem to support the general trend, but with 
smaller differences between the flies. The ques- 
tion may be resolved only as figure 4 evolves. m 


get the answer.” 
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RECOGNITION 


a. Jn 


Build a reputation 


To get respect ina field, scientists need to consider not just 
their work, but also their interactions with others. 


BY CHRIS WOOLSTON 


ess than a decade after receiving 
L= undergraduate degree in biol- 
ogy, Holly Bik has transformed her- 
self. When she started her PhD, she was as 


an aspiring marine biologist with a deep 
interest in nematode worms. Today, she is a 
highly regarded interdisciplinary computa- 
tional and evolutionary biologist who travels 
the world to give talks on topics that range 
from use of social media to what she dubs 
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‘ecophylometamicrobiomics’ — the identi- 
fication of eukaryotic microbes in the envi- 
ronment through sequencing. Now at the 
University of Birmingham, UK, she has led 
the development of the data-visualization plat- 
form Phinch and is actively involved in three 
working groups tackling issues as diverse as the 
evolution of indoor microbial communities 
and the biodiversity of the deep sea. 

It is all a big leap from worms. How did 
she become such a sought-after figure in the 
science community? The key to property is 
said to be location, location, location; in sci- 
ence, it’s all about reputation, reputation, rep- 
utation. “I’m trying to cultivate a reputation 
as an interdisciplinary researcher,’ says Bik. 
“Marine biology, computer programming, 
genomics — I want people to think of me asa 
potential collaborator.” 

If science were truly a double-blind enter- 
prise, generic researchers X, Y and Z would 
compete for citations, grants, invited talks and 
promotions solely on the basis of their accom- 
plishments and aptitude. In the real world, 
scientists have names, and those names come 
with baggage, both positive and negative. In 
an increasingly competitive scientific environ- 
ment, a reputation may matter more than ever, 
says Philip Bourne, associate director for data 
science at the US National Institutes of Health 
(NIH) in Bethesda, Maryland. “The degree of 
separation between any two scientists is rela- 
tively small,’ Bourne says. “If youre colossally 
brilliant, you can be ajerk and still have a good 
reputation. But if you're a mere mortal, the way 
you treat science and the people around you 
will come back on you.” 

Savvy young researchers make the effort 
to define themselves. As they move through 
their professional lives, they work to ensure 
that their name is an asset to their develop- 
ment — and not a deficit or a neutral factor. 
For early-career scientists, Bourne says, repu- 
tations are based largely on the quality of their 
work, which should always be a top priority. 
But researchers also need to keep several key 
issues at the forefront of their minds: their 
relationships with fellow scientists, their pres- 
ence in the broader scientific community and 
their willingness to do what it takes to protect 
and promote their personal brand. Conducted 
appropriately, peer review, social-media use 
and collaboration can all help to create and 
build a positive identity. “Science is like liv- 
ing in a small village,” Bourne says. “You can't 
escape what's known about you.” 

Once a reputation reaches critical mass, it > 
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> can start to matter as much as or more than 
the strength of ideas, adds Alexander Petersen, 
a mathematician and economist at the IMT 
Institute for Advanced Studies Lucca in Italy. 
“Reputation affects all areas of science,” he 
says. “You can't know everything about every 
researcher, so name recognition becomes an 
important discrimination tool” 

Certainly, broad measures of impact such as 
h-indexes form an important part ofa research- 
er’s reputation. But such raw numbers do not 
tell the whole story of a scientist, says Ginny 
Barbour, chair of the Committee on Publica- 
tion Ethics in Brisbane, Australia, and medicine 
editorial director of the Public Library of Sci- 
ence. In 2011, she and Bourne wrote an edito- 
rial’ that set out ten important steps to building 
and maintaining a scientific reputation. Rack- 
ing up as many citations as possible was not on 
the list. Instead, they encouraged researchers 
to commit themselves to integrity, attention to 
detail and productive interpersonal relation- 
ships (see “Tricks of the trade). 


TEAM PLAYER 
Some steps — such as putting in the time and 
effort needed to do good reviews of grant 
applications and papers — may not pay obvi- 
ous dividends at first, Barbour says, but the 
rewards build over time. “Good academics 
take the job of reviewing very seriously,’ she 
says. “I’ve seen reviews come in at midnight 
on a weekend. Or on Easter Sunday.” Even 
when reviews are supposedly anonymous, 
the reviewers can often be identified by tone 
and style, she says. And if a reviewer makes 
the effort to carefully weigh up the science and 
provide thoughtful and constructive feedback, 
the word gets around. “Any academic can tell 
you who a good reviewer is,’ she says. 

At first glance, having a reputation as a 
shoddy reviewer 


might not seem like “You can’t know 
much ofaburdento everything 

a career. After all,as about every 
Barbour notes, poor researcher, 
reviewers are likely soname 

to get fewer requests reco gnitiion 

to judge grant appli- becomes an 
cations, so they have important 

more time to devote discrimination 
to their actual work. tool.” 


But those research- 
ers can also develop a reputation for not being 
a team player, a label that can be debilitating 
in the increasingly collaborative science arena. 
Ina similar vein, Barbour says, it is important 
to show concern and respect for students 
and for other people in the lab, not because 
students can immediately help a career, but 
because word will spread if a scientist seems 
too demanding, stand-offish or — worse — 
eager to accept credit for someone else’s work. 
“It benefits everyone to be professional and 
supportive,” she says. 

Barbour says that if she had a chance to 
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TRICKS OF THE TRADE 


How to build a positive reputation 


In 2011, an editorial in PLoS Computational 
Biology’ offered “ten simple rules for 
building and maintaining a scientific 
reputation”. Here is a brief synopsis of the 
advice. 

@ Learn to accept criticism gracefully. As 
tempting as it might be to fire off a testy 
e-mail after a real or perceived slight — do 
not do it. Take the time to respond 
thoughtfully and professionally. 

@ Do not ignore people below you on the 
career ladder. It might not be obvious now, 
but responding to e-mails and phone calls 
from students can do wonders for your 
reputation. If you help them to feel valuable, 
they will value you in return. 

@ Take publishing seriously. No matter 
where you are on the list of authors, 
diligently check everything that carries your 
name. And you should never accept — or 
offer — author credits that are not deserved. 
@ Declare conflicts of interest. Everybody 
has conflicts, so it is best to stay open and 
transparent. And if you are ever asked to 


update her editorial on building a reputation, 
she would add one more item: cultivate a posi- 
tive online presence. Some researchers have 
already received that message. “Social media 
has been a big part of building my own reputa- 
tion,’ Bik says. Twitter, she says, has been an 
excellent forum for burnishing her personal 
brand as a multidisciplinary, collaborative 
researcher. Every time she tweets about her 
own work or her take on other papers, her 
reputation spreads. “I’m regularly invited to 
speak at conferences and give departmen- 
tal seminars at different institutes. Many of 
these invitations happen because students 
and other researchers know about my work 
through Twitter,’ she says. Bik, whose handle 
is @Hollybik, has tweeted more than 11,000 
times since September 2010 and has more 
than 5,000 followers, including scientists from 
a wide variety of fields. 

Bik explains that she has a strategy for 
promoting her latest publications on social 
media. “I'll tweet throughout the week lead- 
ing up to the publication,” she says. “Ill also 
write blog posts and work with the university 
press office to get coverage.” This approach not 
only increases the visibility of her work, but 
also ensures that it is promoted in a fair, accu- 
rate manner. Her efforts are in keeping with a 
growing sense in the scientific community that 
researchers need to be actively involved with 
press offices to ensure accurate coverage of 
their work. 

Bik is surprised by the number of research- 
ers who seem reluctant to embrace social 
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review work from a competitor, you should 
strongly consider opting out. 

@ Support the scientific community. 

Review papers, share data, help to organize 
meetings — in other words, give back to the 
community as much as you take. 

@ Do not overcommit. Some scientists fill 
their slates past the point of reason. With 

so much to do, few things actually get done 
well or on time. And that is the sort of thing 
people notice. 

@ Be honest with letters of recommendation. 
You are not doing your reputation — or the 
world of science — any favours by writing a 
glowing recommendation for someone who 
is not up to the job. 

@ Maintain your integrity. These days, itis 
easier than ever to take shortcuts with data, 
images or text. Even a hint of misconduct 
can ruin a scientist’s reputation, with good 
reason. To maintain a reputation as a 
reliable and respected researcher, you will 
have to make sure that every aspect of your 
work adheres to accepted guidelines. C.W. 


media. Some do not even bother to have a 
website. “If you don’t create your own online 
reputation, other people are going to do that 
for you,” she says. She thinks that researchers 
who stay away from social media are often 
apprehensive about making a public misstate- 
ment or getting into an argument that could 
jeopardize their standing or their careers. “I 
think that fear is unfounded,’ she says. “I have a 
list of things I will and won't talk about. As long 
as youre arguing from a scientific perspective, 
the positives always outweigh the negatives.” 

Barbour says that scientists should be 
especially cautious when reviewing another 
researcher's work publicly. All too often, 
researchers see review as an opportunity for 
snarky comments, sarcasm and generally 
unprofessional behaviour. “It can be very com- 
bative and personal,” she says. “Everything you 
say post-publication reflects on you as a scien- 
tist. You should treat it the same way you would 
a pre-publication review.” 


THE SPREAD EFFECT 

Building name recognition can have a cascad- 
ing effect on a researcher's career. Petersen was 
the lead author of a study’ that found that a 
single ‘big name’ on a paper can significantly 
boost citations, regardless of the merits of 
the content, especially early in the life of a 
publication. 

After calculating ‘reputation scores’ 
of 450 highly cited scientists, he and his 
colleagues found that a tenfold increase in 
reputation increased the citations by 66%, but 
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the effect wore off after a large number of 
citations. “Reputation aggregates,” Petersen 
says. “Every citation spreads your name a 
little bit more. If you strike it big, you can 
strike it very big” 

Some young researchers try to exploit the 
big-name effect by collaborating with emi- 
nent names in their field, even if it means 
taking a spot far down on the list of authors. 
But there are downsides for people who 
pursue the coat-tail effect. Santo Fortunato, 
one of Petersen's co-authors and a statisti- 
cian and social scientist at Aalto University 
in Finland, notes that famous scientists do 
not necessarily have much time to offer 
extensive assistance with a paper or any- 
thing else. “You should be careful in choos- 
ing your co-authors, but realize that aname 
itself is not enough for a paper to be really 
successful,” he says. “Quality work is still the 
best statement you can make.” 

Early-career researchers might think 
— with some reason — that they can get 
the best head start on creation of a positive 
reputation and on their career by earning 
their PhD at a big-name university. A 2014 
study published in Science Advances’ found 
that one-quarter of institutions accounted 
for 71-86% of all tenure-track hires in the 
fields of computer science, history and 
business. The authors conclude that insti- 
tutional prestige has an “enormous role” in 
faculty hiring across disciplines. 

Bourne says that this kind of “reputation 
by proxy” — the assumption that high- 
quality universities and high-quality labs 
tend to produce high-quality researchers 
— is pervasive in science. “Reputation rubs 
off? he says. But young scientists should not 
despair if they do not have a pedigree. Asa 
recipient ofa PhD from Flinders University 
in Adelaide, Australia, he is proof that one 
does not need to attend an illustrious insti- 
tution to go far in science. After a while, 
he says, accomplishments start to matter 
more than the education section of a CV. “T 
always amuse myself because nobody's ever 
heard of the university I went to,” he says. 

Bik, who is originally from Boston, Mas- 
sachusetts, earned her PhD at the Univer- 
sity of Southampton, UK. 

She completed a postdoc in a big-name 
lab — Jonathan Eisen’s evolutionary-biol- 
ogy lab at the University of California, 
Davis — but for the most part she has had 
to build her reputation the way that most 
scientists do it: one paper, one conference 
and one tweet at a time. m 


Chris Woolston is a freelance writer in 
Billings, Montana. 
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Visual maps bring 
research to life 


Find the story in the science, says Asmund Eikenes. 


esearch projects do not always follow 
R a linear narrative — but papers, grant 

proposals and conference talks need 
to do just that. Early-career scientists can use 
narrative techniques to create what I calla ‘sto- 
rymap’: a visual model of their research that 
helps them to organize their thoughts and to 
tell a clear, compelling story about their work. 

Creating a storymap 
helps the researcher to 
evaluate the strengths 
and weaknesses of the 
project critically. I use 
this technique to visu- 
alize my ongoing pro- 
jects and to improve my 
understanding of the 
content and direction of 
my research. There are 
several ways to make a 
storymap, whether on 
a blackboard, white- 
board or on the web 
using tools such as Prezi 
(www.prezi.com). The . 
process is similar to that 
used by a detective, who pins notes and pictures 
to a wallas she or he maps outa case. 

A researcher can instead use drawings, 
schematics and preliminary figure panels 
to map out a project. The process of sorting 
microscope images, graphs and diagrams into 
the storymap provides an instant overview of 
the project's status. 


AWEB OF IDEAS 
Although an experimental strategy might 
work well for one aspect of a research project, 
other parts might need a different approach. 
The initial research question can also change 
over time. The storymap helps to clarify 
these diverging paths early on, and allows the 
researcher to consider alternative strategies. Ifa 
scientist maintains the storymap ona shareable 
online platform such as Google+, he or she can 
also communicate the results of ongoing inves- 
tigations in collaborative projects, and everyone 
involved can interact with the storymap content 
in real time. Graduate students, postdoctoral 
researchers and principal investigators can 
discuss the overview online and keep up with 
the progression of the research visually. 
Historically, researchers have planned their 
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work using long lists in lab notebooks or on 
PowerPoint slides. Storymapping, however, 
provides a clear visual reference for exploring 
potential new directions. Based on that insight, 
researchers can decide what supporting 
experimentation they need to pursue. 

The visual model also lets the researcher see 
missing pieces of the project that need atten- 
tion. This is instru- 
mental for thoroughly 
understanding the 
data, and for narrating 
the project in a clear 
and effective way. 

Storymapping can 
also help to guide the 
manuscript writing 
process. A well-written 
paper engages readers 
with a logical flow of 
intriguing questions 
and well-supported 
answers throughout 
the text. The story- 

za map highlights the 

questions and answers 

that anchor the project, and so serves as a guide 
for describing the results. 

Ilet the visual overview help me to articulate 
the sentences that serve to transition between 
results — the glue of the story. This approach 
helps me to produce the framework of the 
manuscript; these paragraphs will guide the 
narration of the experimental work. I start 
with the sentences that will link each section 
of the story, creating a structured backbone 
that I can use to build the first draft. After this, 
I find it much easier to fill in the gaps, akin to 
colouring inside the lines in a drawing book. 

This structure places the data into context 
and shapes the manuscript into a smooth 
progression of results that trigger insightful 
questions, which the paper offers specific 
experimental ways to address. 

Over the past four years, I have employed 
these storymapping techniques both alone 
and with colleagues, and find that they have 
significantly contributed to my development 
as a scientist. Using storytelling as a tool 
while working on a project facilitates critical 
thinking, and so enhances the scientific work. m 


Asmund Eikenes is a science writer in Oslo. 
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Ua SCIENCE FICTION 


TEMPUS OMNIA REVELAT 


BY TIAN LI 


¢C C ey, Geek-boy, what’s the new 
H episode of Rome like?” My room- 
mate Scott strolls into the sitting 

room and flops beside me on the couch. He 
leans over to get a clearer view of my laptop. 

“Tt’s only the raw cut, but I'd say it’s as good 
as the earlier ones,” I reply. 

I’m lucky. Thanks to my language and 
history studies I get privileged access to 
the undubbed episodes of Rome at least 
24 hours before the edit is broadcast. 

Scott shrugs. I think he’s losing inter- 
est in my favourite series. When Rome 
first started, he watched the undubbed 
episodes with me, listening to my rapid 
translation of the dialogue. He even 
boasted to his new girlfriend that he 
could understand all the Latin emanat- 
ing from the screen — although his confi- 
dence got a bit dented when I told him that 
some of the time they were speaking Greek. 

But over these past few weeks, his eager- 
ness has subsided. First he stopped watching 
the show with me, waiting instead for the 
English dubbed version to become available. 
And last week, I even caught him watching 
the old series of the same name that was 
made back in 2005. 

“T didn't like last week’s episode,” Scott 
says, confirming my fears. “When I saw 
Cleopatra I was shocked. I mean, how could 
such a woman have charmed both Julius 
Caesar and Mark Antony? Let’s face it, she 
was no Elizabeth Taylor or Vivien Leigh.” 

I sigh. I'd heard other people say the same 
thing, and the audience reports I'd seen 
echoed Scott’s sentiments. Even though 
some viewers were impressed by Cleopatra's 
wisdom or were captivated by her splendid 
jewels and robes, almost everyone agreed 
that she wasn't the Cleopatra of their imagi- 
nation. Clearly the face of the world was 
indifferent to the length of her nose. 

Maybe that was why ratings had slumped, 
with last week’s episode earning pretty much 
the lowest score possible. 

“Well, you won't have to see her any more,’ 


I say, trying keep the frustration out of my 


voice. “Rome has been cancelled” 
Scott’s eyes widen a little. “You're kidding! 
Why?” 
Maybe he’s feign- 
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A historical perspective. 


work,” I say with a shrug. “Not enough 
people were watching it” 

The fact that Rome had to be broadcast in 
unfashionable 2D hadn't helped — but what 
could the audience expect? It isn’t easy to get 
a clean signal from 2,200 years ago. 


But I knew that wasnt the only reason for 
the programme’ failure. Many viewers found 
the gladiatorial games and the images of slave 
torture revolting. Scenes that the producers 
had expected to be ratings gold had turned 
out to last too long and be far too gory. 

Td even seen viewer complaints about 
Caesar, arguing that the series should show 
more about his ‘teenage years and personal 
life, so that they could tap into ‘the other side 
of Caesar’ But that’s not how real life works 
— it is what itis. 

“Too bad,’ Scott mutters. “T thought there 
would be a second season.” 

I glance at him, surprised. “I didn't think 
you were a fan.” 

He smiles. “Well, it might be a touch way- 
ward, but at least it tells the truth — and it 
gives us a chance to resolve a few histori- 
cal mysteries. I actually wanted to find out 
why Cleopatra left Antony at the Battle of 
Actium. I mean, think of The White Queen, 
basically we watched season 2 to find out 
what really happened to Edward V” 

I grin. It was true that the unfortunate 
young king had saved that programme’s 
ratings. 

“But do you really want the truth?” I 
ask. “Let’s face it, the truth tends to be very 
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different from what you imagine — look at 
Cleopatra.” 

Scott nods. “Maybe you're right” 

It's a shame: these historical documentaries 
are more than mere entertainment. They dis- 
close truths long-hidden by the veil of the past. 
I know that one day they will be acclaimed and 
valued, although maybe not today. 

Suddenly, Scott turns to me. “Hey, you've 
just watched the next episode of Rome 
online! So it exists, which means it will be 
on TV, right?” 

I shake my head. “Rome’s been cancelled. 
There will be no more episodes on TV” I 
pause. “But maybe it can be saved...” 
Scott shoots me a questioning glance. 
“Well,” I say, warming to my theme, 
“the costs for the series are relatively low: 
no fees for actors, no money needed for 
costumes or scenery. HBO could air 
the rest of the episodes online — we 
could still see Caesar, Antony and Gaius 
Octavius. You may yet find out what really 
happened during the battle of Actium”” 
Scott’s eyes light up. “You mean we could 
get season 2?” 

“T guess. But the online episodes will be 
just the raw versions. No overdubs or official 
English subtitles.” 

“Could you translate the dialogue?” Scott 
looks at me and I see in his eyes the same 
excitement that was there when he watched 
the first episode. 

“Better than that. I will subtitle them in 
English online,’ I grin. 

Subtitling episodes is the least I can do for 
the people of this era. The myriad tongues I 
know would make no sense to them — after 
all, to them I’m just another language geek. 
But I have the right connections at HBO. 
And if I do the translations, at least some 
viewers will learn something. And watch- 
ing Scott discover more about Rome — well, 
that will be entertainment in itself. 

I hope the ratings of my show, Adven- 
tures of a History Geek, are better than those 
for Rome. I need to pay off the loan for my 
hovership when I get back to my time. 

Tomorrow, I’ll persuade Scott to buy a 
nice new shirt. If he continues to look good, 
his popularity can only increase. Even 
though he'll never know that he’s on TV, I'm 
going to make him a star. m 


Tian Li lives in China, and received her 
master’s degree in biology at Tsinghua 
University in 2012. She loves detective and 
science-fiction novels. 
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